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A Note to the Reader 


This book does not follow the design of a traditional math textbook. There 
are very few complete proofs included, and the exercises are not listed at the 
end of each section. Instead, this text is an invitation to ponder, question, 
create, and figure out on your own some beautiful mathematical results in the 
field of knot theory. Exercises are sprinkled in between statements of 
definitions, descriptions, and propositions. Some exercises are designed to 
introduce you to new ideas and to point you in the direction of why or how an 
idea is important. Other exercises guide you through the technical and subtle 
arguments that provide the foundation for our understanding of knots and 
links. Answers are rarely provided in full, but the path toward a solution or 
proof is illuminated. 


As indicated by the word interactive in the title, this book is meant to be read 
with paper and pencil (colored pencils, string, and pipe cleaners, too!) in hand 
so that you can jot down your ideas, explore how a new definition applies to 
your favorite examples, find the answer to a question, or prove a theorem. As 
you are reading the text, we hope you will be in a space where conversations 
and collaborations with others are readily available. 


A Note to the Instructor 


We designed this book for a student-centered classroom where students 
regularly spend class time working collaboratively on problems, presenting 
solutions, and vetting arguments made by their peers. We have used the 
textbook both in a 10-week quarter and in a 15-week semester. Our classes 
typically have 10-20 undergraduate students, though this book could be used 
by a larger class of up to 30 students or even by a single group of 2-3 
undergraduate research students. Indeed, we have successfully used parts of 
the book to introduce summer research students to the fundamental ideas in 
knot theory. Prerequisites for this book include a course on proof writing 
(which includes exposure to ideas like sets and modular arithmetic) and proof 
techniques (e.g., proof by induction) as well as a course on linear or matrix 
algebra. 


Chapter 1 


Playing & Building Intuition 


Each section in Chapter 1 is a hands-on introduction to knots, links, and 
equivalence. In this chapter, we will learn about several foundational 
concepts in the study of knots. The formal definition of a knot is postponed to 
Chapter 2 to allow time to first play and build intuition within the 
mathematically rich and beautiful field of knot theory. 


Informally, a knot is a closed loop in space. The term closed loop means that 
the loop has no loose ends, and no beginning or ending points. You can think 
of a knot as a knotted-up circle made of string or wire. A link is a collection 
of closed loops in space and the number of loops is called the number of 
components of the link. A link can have one component. Thus, knots are just 
special types of links having only one component. Note that, when we use the 
term ‘link’ in this book, we are generally referring to both knots and links 
with more than one component. 


The examples of knots and links seen in Table 1.1 are flat drawings of 3- 
dimensional loops in space. A 2-dimensional drawing of a link is called a 
diagram of the link. In a diagram, the term crossing is used to describe a 
location where one portion of the link passes over another portion of the link. 
Crossings are identified by a short break in the drawing of the curve, which 
indicates that this portion of the curve is passing under the unbroken portion 
of curve. 


Two links are called equivalent if they have the same number of components 
and they can be physically manipulated in space (rotated, bent, twisted, 
stretched, etc.), without cutting, so that the first link is transformed identically 
into the second. We imagine that our strings are highly elastic so they can be 
scaled up or down in size, stretched and contracted. 


While playing and building your intuition with the activities in this chapter, 
you may come up with your own questions or conjectures. We encourage you 
to collect and write down your ideas and add them to the list of questions in 
Section 1.8. Perhaps you will create your own new open research question 
about knots or perhaps you will stumble upon the same questions that the 
founders of the field of knot theory have puzzled over for years. 


One last note before you begin. In mathematics, formal proofs of theorems 
rely on concepts and constructions being formally defined. Since the formal 
definition of a knot is not given in Chapter 1, we will not be asking the reader 
for formal proofs in this chapter. Instead, we ask the reader to give an 
argument that a statement is mathematically valid. We specifically use the 
term ‘argument’ rather than ‘proof to allow for what we recognize is a 
tension between the desire for a formal proof with an initial lack of the formal 
definitions that are needed to make such a proof rigorous. An argument, in 
this setting, may be viewed as being less formal than a proof, but it should be 
as clear and complete an explanation as possible. In Chapter 2, you will 
notice a shift from play to formalism. Formal proofs related to the definition 
of a knot or link will serve as the foundation both for our play and for our 
proofs in the remainder of the book. 


Table 1.1: Examples of Link Diagrams. 
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1.1 Projections, Diagrams & Equivalence 
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Exercise 1.1.1. (a) Identify the knots in Table 1.1. Build them with pipe 
cleaners, and determine any equivalences between different pictures. Record 
your findings and conjectures. (b) Identify the links with more than one 
component in Table 1.1. Build them with pipe cleaners, and determine any 


equivalences between different pictures. Record your findings and 
conjectures. 


Given a link L in space and a light source some distance away, the shadow of 
the link made on a plane across from the light source is called a projection or 
shadow of the link. Projections can look similar to the pictures in Table 1.1, 
but they are missing information about which is the under-strand and which 
is the over-strand. The curve intersections in a projection are called 
precrossings. A precrossing is said to have been resolved once we have 
selected the crossing information (that is, we have specified which strand 
passes over and which passes under at the crossing). Once all crossing 
information is determined in a link projection, the image is then a link 
diagram. 


Figure 1.1.1: A knot projection in a plane, the knot in space, and a light 
source. 
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The projection of a link L onto two distinct planes in R* can result in 
strikingly different images. Some projections are nonstandard and cannot be 
used to recreate the link in space, even if crossing information is included. 
While such projections may be useful in certain situations, they are not the 
projections that knot theorists typically study. (A list of specific projections 
characteristics to avoid will be studied in Chapter 2.) Two different 
projections of a link, L, can also have vastly different numbers of crossings. 
The following two exercises investigate the relationship between a link, its 
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numerous projections, and the diagrams stemming from those projections. 


Exercise 1.1.2. (a) Make a knot out of rigid material such as wire. Draw a 
projection of that knot from two perspectives that result in significantly 
different projections. (b) For that same knot, draw a nonstandard diagram of 
your knot from which the knot cannot be reconstructed. (For instance, maybe 
there are places in your diagram where several strands are tangent to each 
other or where three or more strands of the knot intersect at a single point.) 


Exercise 1.1.3. Create a single rigid knot K in space such that one projection 
of K results in a diagram with four crossings and another projection results in 
a diagram with zero crossings. 


Exercise 1.1.4. The knot labeled (a) in Table 1.1 is called the unknot or the 
trivial knot. Give an argument explaining why any knot diagram with 
exactly one or exactly two crossings must be equivalent to the unknot. (Hint: 
Draw the crossing(s) first, and then make a knot by connecting the ends in all 
possible ways so that no more crossings are created. Approach this task 
systematically so that your argument shows, without a doubt, that all possible 
ways have been considered.) 


We’ ve just seen that there are no nontrivial knots that can be drawn with just 
one or two crossings. In fact, the smallest nontrivial knot is a knot that can be 
drawn with three crossings. Any knot that can be drawn with three crossings 

and no fewer is typically referred to as a trefoil knot. 


1.2 Crossing and Unknotting Numbers 


Some of the links in Table 1.1 can be manipulated in space and then redrawn 
with fewer crossings. The crossing number of a link L is the minimum 
number of crossings needed in a diagram of L. For instance, if the crossing 
number of L is five, then it is impossible to draw a diagram of L that has four 
or fewer crossings. 


Exercise 1.2.1. For each link in Table 1.1, make a conjecture about the 
crossing number of the link. Can you provide an argument in support of any 
of your conjectures? Which conjectures do you not yet have enough tools to 


prove? 
Exercise 1.2.2. Determine the crossing number of the knot in Figure 1.2.1. 


Figure 1.2.1: A complicated knot? 
Exercise 1.2.3. Consider the knot projection in Figure 1.2.2. Start with an 
arbitrary precrossing of P and resolve it into a crossing. Imagine grasping the 
over-strand and pulling this strand up out of the plane of the paper. Using this 


visualization as inspiration, show for your projection that the remaining 
precrossings can be resolved so that the resulting knot is the unknot. 


Figure 1.2.2: A knot projection. 


Exercise 1.2.4. Use the example in Exercise 1.2.3 as a guide to write an 
argument that given any knot projection, the precrossings can be resolved to 
produce a diagram of the unknot. (Hint: There are two parts to this problem. 
First you must devise an algorithm to create the desired unknot diagram. 
Then you must argue that the resulting diagram actually is the diagram of the 
unknot.) 


Let K be a knot. Using your argument in Exercise 1.2.4, we can show that, 
given any diagram D for K, some number of crossings can be changed in D to 
produce a diagram of the unknot. Note that changing a crossing in a knot 
diagram is like passing the corresponding knot through itself in space. 
Typically, this “passing through” operation results in a different type of knot. 
The minimum number of times the knot must pass through itself before it 
becomes unknotted is called the unknotting number of the knot. 


Note that the definition of unknotting number is a spatial characteristic of the 
physical knot. Thus, the unknotting number is not tied to a particular diagram 
of the knot. We can use diagrams to build intuition about what a knot’s 
unknotting number might be, but such intuition does not always constitute a 
proof that a given knot has a specific unknotting number. For instance, if we 
can show that changing two crossings in a diagram of a knot K produces the 
unknot, we can say that the unknotting number of K is at most two. We 
cannot, however, claim that the unknotting number of K is exactly two 
without more work. Perhaps there’s another diagram of the knot that only 
requires one crossing change to become unknotted. 


Exercise 1.2.5. Use the diagrams provided to conjecture the unknotting 
number of the knots labeled (b), (f), and (0) from Table 1.1. Experiment with 
alternate diagrams for these knots to give additional support for your 
conjectures. 


We will take a closer look at the unknotting number and study interesting 
ways to unknot diagrams in Chapter 7. 


1.3 Alternating Knots 


A diagram D for a link L is called an alternating diagram provided that, if 


you traverse each component in the diagram, then you alternately pass over 
and under the crossings. For example, the diagram (b) from Table 1.1 is 
alternating, while the diagram (1) is not. A link L is called an alternating 
link provided that there exists an alternating diagram D for L. 


Exercise 1.3.1. Determine which links in Table 1.1 are alternating links. 
Record your findings. (Note that there are many nonalternating diagrams that 
can be created of a given alternating link, so just because a particular diagram 
is not alternating does not imply that no diagram of that link is alternating!) 


Exercise 1.3.2. Choose one of the alternating links you found in Exercise 
1.3.1 that has an alternating diagram in Table 1.1. Produce an equivalent, 
nonalternating diagram of this link. 


Exercise 1.3.3. Consider the knot projection in Figure 1.2.2. Show that the 
precrossings can be resolved in such a way that the diagram becomes an 
alternating diagram. Investigate whether this can always be done for any knot 
projection. Can you find a knot projection for which there is no possible 
selection of crossings that will result in an alternating diagram? Record your 
findings, arguments, and conjectures. 


The following three exercises relate to finding the crossing number of an 
alternating diagram. 


Exercise 1.3.4. Consider the alternating diagrams in Table 1.1. For each 
alternating diagram, can you produce an equivalent diagram for this link that 
has fewer crossings? If so, is the resulting diagram alternating? Record your 
findings. 


Exercise 1.3.5. Consider the alternating knots in Figure 1.3.1. For each 
alternating knot, can you produce another diagram for this knot that has fewer 
crossings? If so, is the resulting diagram alternating? Record your findings. 


Exercise 1.3.6. Suppose K is an alternating knot. Use the exercise above to 
State a conjecture regarding any diagram of K that has a minimum number of 
crossings. Are there required properties for the diagram of K? Will a diagram 
of K with minimum crossings always be alternating? Create your own new 
examples to give further evidence in support of your conjecture. 


We will revisit your conjecture and provide a proof determining the crossing 
number for alternating links in Chapter 6. 


Figure 1.3.1: Examples of alternating knots. What is the crossing number of 
each knot? 
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1.4 Games with Knots 


Given a projection of a knot, we can play a game called the Knotting- 
Unknotting game. Suppose two players, Kenya and Ulysses, take the knot 
projection in Figure 1.4.1 as their starting “game board.” 


Figure 1.4.1: A game board for the Knotting-Unknotting game. 


The players take turns resolving precrossings until all crossing information 
has been determined. Once a crossing has been resolved, it cannot be 
changed. Kenya’s goal is to create a nontrivial knot, while Ulysses’ goal is to 
make the unknot. 


Exercise 1.4.1. Find a friend and decide who will play the part of Kenya and 
who will play the part of Ulysses. Next, decide who will play first and who 
will play second. Play the game on the projection in Figure 1.4.1. Who won? 
Did the winner seem to have an unfair advantage? 


Exercise 1.4.2. Play the same game as in the previous exercise, but switch 
who goes first. Who won? Did the winner seem to have an unfair advantage? 


Exercise 1.4.3. Now draw your own projection and play another round of the 
Knotting-Unknotting game. Describe any strategies you uncover for how 
each player should play the game. 


Exercise 1.4.4. Is there a knot projection on which Ulysses (the unknotter) 
can always win, regardless of whether he plays first or second? 


Exercise 1.4.5. Is there a knot projection on which Kenya (the knotter) can 
always win, regardless of whether she plays first or second? 


Exercise 1.4.6. Invent and explore your own game that can be played ona 
knot diagram or a projection of a knot diagram. 


1.5 Mirrors, Orientation & Inverses 


The mirror image of a link diagram D, denoted D", is the link obtained by 
changing all of the crossings of D. In other words, each crossing over-strand 
becomes the under-strand of that crossing and vice versa. 


Exercise 1.5.1. Let D be a link diagram and D* be obtained by reflecting D 
through a mirror. Explain why D* and D” result in equivalent links. (Use the 
description of equivalence of knots and links that was given at the beginning 
of this chapter.) 


A link is called chiral if it is not equivalent to its mirror image. If it is 
equivalent to its mirror image, it is called amphichiral or achiral. 


Exercise 1.5.2. Investigate whether the knots (c), (d), (n), and (0) in Table 
1.1 are chiral or achiral. Record your findings and conjectures. 


A link can be given an orientation by simply assigning a direction of travel 
around each loop. Orientation is typically indicated by drawing one or more 
small arrowheads as shown in Figure 1.5.2. 


Figure 1.5.1: A knot and its mirror image. 
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Figure 1.5.2: Example of an oriented knot. 
A) ‘Sa 


For an oriented knot diagram D, the same diagram with the opposite 
orientation is called the reverse of D, denoted f. For some links, the choice 
of orientation does not matter. That is, the oriented link is equivalent to its 
reverse. A link with this property is called invertible. 


Exercise 1.5.3. (1) Determine whether or not the links in Table 1.1 are 
invertible. (2) Create your own knot with seven or fewer crossings and 
determine whether or not your knot is invertible. 


1.6 Knot Composition & Prime Knots 


Given two link diagrams, we can create a new link by removing a small arc 
from each diagram and then connecting the four endpoints by two new arcs, 
as in Figure 1.6.1. For two links J and K, the new link is called the 


composition of J and K and is denoted J#K. 


A knot is called a composite knot if it can be expressed as the composition 
of two nontrivial knots. The knots that make up the composite knot are called 
the factor knots. 


Figure 1.6.1: The composition of K and L. 
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Exercise 1.6.1. Determine the result of composing the unknot with a link L. 
Record your findings. 


If a knot is not the composition of any two nontrivial knots, then it is called a 
prime knot. 


Exercise 1.6.2. The knots in Figure 1.6.2 are composite. Make each knot out 
of string. Play with your physical knots to identify their prime factor knots. 


Figure 1.6.2: Examples of composite knots. 


1.7 Knot Notation 


Suppose you are studying an alternating knot diagram while having a phone 
conversation with a friend about some interesting aspect of your diagram. 
Suspend disbelief to further suppose that your friend doesn’t have a 
smartphone or any other device that would allow for a picture of your 
diagram to be sent to her. How can you give her instructions over the phone 
to recreate your diagram? Is there a systematic way that you can encode or 
describe the diagram and crossing information so that she can reconstruct 
your picture? 


Exercise 1.7.1. Work with a friend to create notation for encoding alternating 
knots that can be used in non-pictoral communication with a person or a 
computer. Your notation should be easy to determine; that is, given a knot 
diagram, there should be a relatively straightforward algorithm to follow to 
determine the notation for that particular diagram. Your notation should also 
be easy to use; that is, it should allow someone to recreate the diagram of the 
knot that was used to determine the notation (or recreate an equivalent 
diagram) with no additional information other than your notational 
conventions. 


Exercise 1.7.2. Investigate any limitations of the notational scheme you 
created above by working with the same friend and following the steps 
provided below. Record and summarize your findings. 


Step A. 


1. Sit back-to-back with your friend. Each of you should have paper and 
pencil in hand. 


2. Secretly select one of the alternating knot diagrams from Table 1.1 and 
use your notation to encode this knot. 


3. Pass your notation only (no figures or references to Table 1.1) to your 
friend. They will then pass their notation to you. 


4. Decode the notation given to you by drawing the diagram that the 
notation suggests. 


D: 


6. 


Tum around to compare your diagram with their selected diagram from 
Table 1.1. Is your decoded diagram the same as or equivalent to the 
diagram that they encoded? Why or why not? 


Record and summarize your findings. 


Figure 1.7.1: Use this diagram in Step B. 


Step B. 


1. 


Sit back-to-back with your friend. Each of you should have paper and 
pencil in hand. 


. Draw the knot in Figure 1.7.1 and encode it using your notation. 


. Are there any choices made in the determination of your notation from 


a diagram? If so, make these choices in various ways to create a list of 
notations describing the diagram. 


. Exchange notations with your friend. 
. Using your friend’s list of notations, draw each knot diagram. 
. Are all of the drawn diagrams equivalent? Why or why not? 


. Are there any choices you had to make when recreating the diagram 


from the notation? What are they? Do the various choices always result 


8. 


9. 


in equivalent knots? Why? 
Repeat these steps for knot (b) from Table 1.1. 


Record and summarize your findings. 


Step C. 


1. 


2; 


Draw your own alternating knot diagram with five or fewer crossings. 
Encode this diagram using the notational scheme you found above. 


Figure 1.7.2: A twist of an arc of the diagram. 
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. Now twist one arc in your diagram so the result is a new alternating 


diagram that contains one new crossing. Figure 1.7.2 shows what is 
meant by the word ‘twist.’ 


Encode this new diagram using your notation and swap the notation for 
both diagrams with your friend’s notation. 


Decode the notation given to you by your friend. Are the two diagrams 
equivalent? 


Is there a recognizable characteristic within your friend’s notation that 
could be used to identify where the twisting is encoded? Encode and 
decode several more examples to formulate and test your conjectures. 


Exercise 1.7.3. You just designed a method for encoding alternating knots. 
Could your notation be modified to encode information about nonalternating 
knots as well? 


In Chapter 4, we will learn about and investigate several ways to encode 
knotting information. 


1.8 Questions in Knot Theory 


Some of the most difficult questions in knot theory are very easy to state. 
Here are a few questions for us to ponder regarding the basic ideas from this 
chapter. 


1. Given knots K and J, how can we determine if K is equivalent to J? 


2. How might we prove that two knots are not equivalent? In particular, 
how can we prove that a given nontrivial knot is actually nontrivial? 


3. How can we determine the crossing number of a knot? 
4. How can we determine the unknotting number of a knot? 


5. How can one produce a table of the simplest knots? How should the 
complexity of a knot be measured? 


6. Which knots are amphichiral? 
7. Given a knot K, how can we determine if K is prime? 


8. Is there a definition of an alternating knot that relies only on a knot’s 
position in space and not on any diagrams of the knot? 


9. Which knots are invertible? 


Exercise 1.8.1. Add your own questions to the list above. 


Chapter 2 


Definition and Equivalence of Knots and 
Links 


So far, we’ve been exploring knot theory using intuitive ideas about what 
knots and links are and when they are equivalent. In this chapter, we will 
formalize these ideas. We begin with a definition of knots and links in R* 
using collections of line segments. 


2.1 Polygonal Curves & A-Equivalence 


Definition 2.1.1. If A, B are points in R*, we use AB to denote the line 
segment from A to B. For an ordered set of distinct points, (A;, A>, A3,..-, 


A,), with A, « R° for all i, the union of the segments A,A>, A,A3, ... A, 1A), 
A,A, is called a closed polygonal curve in 2°. If each segment intersects 


exactly two other segments, intersecting only at an endpoint, then the curve is 
called simple. We call the points A; the denning points or vertices of the 


polygonal curve. 


Definition 2.1.2. A knot is a simple closed polygonal curve in g°. A link, L, 
is a finite union of pairwise nonintersecting knots. The number of knots in the 
union comprising L is called the number of components of the link. Thus a 
knot is a link with exactly one component. 


Figure 2.1.1: Example of a knot. 
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Figure 2.1.2: Example of a link with three components. 


Definition 2.1.3. The two-dimensional pictures of links, like those seen in 
Figures 2.1.1 and 2.1.2, are called link diagrams. A link diagram comes 
from a particular type of projection (in the standard geometric sense) of the 
link onto a plane in jR%. At each point where the projection of two line 
segments from L results in a point of intersection in the projection plane, one 
of the two segments is drawn as broken, or as if a piece of the line segment is 
missing. The broken segment is assumed to be passing under the unbroken 
segment. The places in a diagram where a segment passes under another is 
called a crossing of the diagram. 


The projection of a link L onto two distinct planes in * can result in 
strikingly different images, and some projections cannot be used to recreate a 
knot in space unless more information is provided. Such projections should 
be avoided for the purposes of our discussion. In the exercise below, we 
investigate projections and create a succinct list of properties to avoid. 


Exercise 2.1.4. Make a knot out of straws and tape with 9 straws. Complete 
the four steps below. 


1. Draw a projection of your knot that looks as if only 8 straws were used 
in the knot. 


2. For the same knot, draw several projections from which the knot 
cannot be reconstructed by simply indicating crossing information. 
Draw any other projections you encounter that contain diagrammatic 
oddities. 


3. Make a list of all properties of projections that should be avoided when 
drawing a link diagram. 


4. Compare your list with a friend’s list to create a complete and succinct 
list of Properties to Avoid. 


Definition 2.1.5. A regular projection of a link L is a projection that has 
none of the Properties to Avoid from Exercise 2.1.4, part 4. A link diagram 
for L is a regular projection of the link L that includes all crossing 
information (i.e., information about which strand passes over and which 
passes under at each crossing). 


It is not immediately obvious that all links have regular projections. 
Sometimes a link must be manipulated in j* before a regular projection can 
be found. However, given any link L, there will always exist a link diagram 
that represents either L or a link equivalent to L. 


Definition 2.1.6. Let A and B be two adjacent defining points of a link L in R°® 
. Suppose C is a point in jg° such that the triangle AABC and its interior 
intersect L only along the line segment AB. We call such a triangle, AABC, an 
elementary triangle. An elementary triangle can be a degenerate triangle 
with C © AB. 


Under these conditions, replacing the line segment AB by the segments AC U 
CB results in a new link, L’. This process of replacement is called an 
elementary move applied to L resulting in L’. The inverse process of 
removing AC U CB and replacing these two segments with the segment AB, 


assuming that the triangle AABC and its interior intersect the link only along 
the segments AC U CB, will also be referred to as an elementary move. 


When applying an elementary move AABC to a diagram of a link, we always 
select a point C that both satisfies the definition of an elementary triangle and 
results in a link diagram for L’. For if a point @ ¢ R* is selected such that the 
resulting diagram is not regular, then the projection plane or the point C can 
be altered slightly to result in a projection to a link diagram. 


Figure 2.1.3: Example of an elementary move applied to L resulting in L’. 
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Definition 2.1.7. Two links, L and L’, are called A-equivalent or delta 
equivalent, denoted ;, 4 ;’, provided there exists a sequence of links L = L,, 


L,,...L, =L' such that each L,,, is obtained from L; via an elementary 
move, foralli=1,...n—1. 


Exercise 2.1.8. Prove that A-equivalence is an equivalence relation on the set 
of links. In other words, prove that 4 is a reflexive, symmetric, and transitive 
relation. 


Exercise 2.1.9. Prove that the knots K, and K>, shown in Figure 2.1.4, are A- 


equivalent. (See Figure 2.1.5 for examples of elementary moves applied to 
K,.) 


Figure 2.1.4: Example of equivalent knots. 
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Figure 2.1.5: A sequence of elementary moves applied to K,. The red 


segments outline the elementary triangle and indicate the new line segment(s) 
that join the diagram after the elementary move is applied. Each arrow, 
except for one, is an equivalence resulting from a single elementary move. 
Can you spot the arrow that is the result of two elementary moves? 
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Exercise 2.1.10. Determine which equivalences shown in Figure 2.1.5 are the 
result of more than one elementary move. 


2.2 Diagram Equivalence via R-Moves 


As we Saw in Exercise 2.1.9, creating and visually following a sequence of 
elementary moves is not always easy. In this section, we develop an alternate 
but equivalent notion of link equivalence that is generated by a small list of 
localized changes to a link diagram. To find this list of small changes, we 
consider subdivisions of elementary triangles into ‘less complicated’ 
elementary triangles. To make this more concrete, let’s look at an example. 


Consider the elementary triangle in (a) from Figure 2.2.1 and the subdivision 
shown in (b). Each subtriangle in the subdivision contains at most two line 
segments of the link diagram. If we can show that for any elementary triangle 
such a subdivision can always be found, then we can redefine link 
equivalence in terms of a sequence of elementary moves using only our ‘less 
complicated’ elementary triangles from a small finite list. This reframing of 
link equivalence will prove to be very powerful in Chapter 5 where we study 
combinatorial properties of links. 


Figure 2.2.1: An elementary triangle AABC and a subdivision. Each sub- 
triangle of the subdivision is itself an elementary triangle. The elementary 
move AABC can also be performed by applying all the elementary moves in 
the subdivision one at a time, starting along the segment AB and moving 
upward toward C. (Check this! Several of the elementary moves must use 
degenerate triangles.) 
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Our goal is to create a short list of simple, localized, elementary moves that is 
long enough so that any elementary triangle can be subdivided into pieces on 
our list. Do you see the tension in this goal? A short list that is long enough. 
In this section, we look to Figure 2.2.1 for inspiration of what to put on our 
list of moves, but there is no guarantee that this example subdivision contains 
all the moves that we will need. (In fact, it does not!) Once our list is 
seemingly complete, then we must prove that our list is long enough so that 
any elementary triangle can be subdivided into moves on our list. 


The first additions to our list of localized elementary moves are a collection 
of simple moves that do not change the number of crossings in a link 


diagram. Examples of these moves, called planar isotopies, are shown in 
Fienre 2.2.2. 


Definition 2.2.1. Let AABC be an elementary triangle in a diagram D that 
contains the link segment AB. The elementary move AABC is called a planar 
isotopy if it satisfies one of the following three conditions. 


(i) In the link diagram, AABC coincides with no points of the link other 
than the segment AB. 


(ii) In the link diagram, AABC and the segment AB lie under (or over) the 
interior of exactly one line segment that is not adjacent to AB. If the 
line segment crosses under the segment AB (resp., over AB), then it 
crosses under (resp., over) either AC or CB after replacement. 


(iii) In the link diagram, AABC and the segment AB lie under (or over) 
exactly two adjacent line segments and their shared vertex. One of the 
two segments crosses Over (resp., under) the segment AB, thus the 
other segment crosses over (resp., under) either AC or CB after 
replacement. 


The inverse of a move of type (i)-(iii) is also a planar isotopy. A sequence of 
several planar isotopies is also considered to be a planar isotopy. 


The dotted circle in Figure 2.2.2 indicates that the move is taking place 
locally, i.e., within a small region of the diagram and the diagram is 
unchanged outside this region. 


Figure 2.2.2: Planar isotopies of types (i), (ii), and (iii). 
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Exercise 2.2.2. Number all the subtriangles in Figure 2.2.1 (b) to indicate an 
order in which they can be applied. (There are many valid ways to do this.) 
For your ordering, determine the number of subtriangles that are planar 
isotopies of type (i), determine how many are of type (ii), and find how many 
are of type (iii). Check with a friend to see if your numbers are the same. 
Your numbers might be different depending on the order in which the 
subtriangle moves are made. 


Exercise 2.2.3. Let L, and L» be the links in Figure 2.2.3. Provide a sequence 
of planar isotopies from L, to L5. 


Figure 2.2.3: The link L, can be obtained from link L, via a sequence of 
planar isotopies. 
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Several of the subtriangles in Figure 2.2.1 (b) are not planar isotopies because 
they change the number of crossings in the diagram. Therefore, we must add 


to our list of localized elementary moves a few simple moves that do change 
the number of crossings. Figure 2.2.4 contains two examples of elementary 
moves that increase or decrease the number of crossings in the diagram by 
one. We add these elementary moves to our list. 


Figure 2.2.4: This elementary move, called an R1 move, changes the number 
of crossings in the diagram by one. 
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Definition 2.2.4. Let AABC be an elementary triangle that lies over or under 
a single edge and that edge is adjacent to an edge that is to be replaced (as is 


the case in Figure 2.2.4). Then the elementary move AABC is called an R1 
move. 


Exercise 2.2.5. Find at least one example of an R1 move subtriangle in the 
subdivision shown in (b) of Figure 2.2.1. 


Before adding more moves to our list, let’s eliminate from consideration a 
special type of elementary triangle that an R1 move is designed to address. 
The next two lemmas will help when proving that our list is “long enough” 
because they allow us to focus our proof on certain “nice” elementary 
triangles. 


From the definition of a link, there are two line segments that emanate from a 
segment AB, and there are two possibilities of how an elementary triangle 
might interact with the two line segments of the diagram that emanate from 
AB. The elementary triangle AABC either (1) doesn’t coincide with the edges 
emanating from AB; or (2) coincides with at least one of the edges emanating 
from AB. The simpler situation of type (1) is pictured in Figure 2.2.5, where 
neither of the edges emanating from AB lies above and neither lies below the 
elementary triangle ABC. Two examples of type (2) can be found in Figure 
2,255. 


Figure 2.2.5: Elementary move of type (1). The edges emanating from AB do 


not coincide with triangle AABC. 
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In the lemma below, we will show that for an elementary triangle of type (2) 
where exactly one edge emanating from AB coincides with the elementary 
triangle, it is possible to subdivide the elementary triangle into a sequence of 
three elementary moves, two of which are type (1) and the other is an R1 
move. This lemma will allow us to concentrate our efforts on elementary 
triangles of type (1) when looking for any remaining necessary additions to 
our list. 


Lemma 2.2.6. Suppose A and B are adjacent defining points of a link L with 
diagram D. Suppose © < R° is a point such that the triangle AABC is an 
elementary triangle of type (2) with exactly one emanating edge coincident 
with AABC. Then the elementary move corresponding to the triangle ABC 
can be performed as a composition of three elementary moves: an R1 move 
and two elementary moves of type (1). 


Figure 2.2.6: Elementary moves of type (2). On the left, one of the edges 
emanating from AB lies under the elementary triangle. On the right, the 
elementary triangle coincides with both edges emanating from AB. 
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Figure 2.2.7: An elementary triangle that coincides with one of the edges 
emanating from AB as well as other portions of the diagram D. 
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Proof of Lemma 2.2.6. Let EA denote the line segment that lies above or 
below the elementary triangle ABC. Select a point A’ along the segment AB so 
that (i) the segment A’C doesn’t intersect any vertices or crossings in the 
diagram D, (ii) the triangle AA’C contains no vertices or crossings from D, 
and (iii) the segment AA’ does not contain a crossing. Such an A’ exists since 
there are only finitely many vertices and crossings in D, but there are 
infinitely many candidates for A’ along the segment AB. Notice that the 
triangles AA’C and A’BC are both elementary triangles since they are 
subtriangles of AABC. 


Next, select a point C’ along the segment AC such that the triangle AA’C" 
coincides with no parts of the link diagram other than the edge AA’ and a 
portion of the edge AE. In particular, select C’ so that the segment AC’ 

contains no crossings. Why does such a point C’ exist? If a point C’ were 


selected and there were some other part of the diagram D that coincided with 
AAA'C", then, by condition (ii) in the selection of A’, the incident portion of 
D would be a straight edge that crosses AAA’C. Where might this edge cross 
A AA'C? It could not cross the segment AA’ by condition (iii) in the selection 
properties of A’. Hence, any straight edge that coincides with AAA'C’ must 
intersect both of the edges AC’ and C’A’. Since A’ was selected to satisfy (i), 
this edge could not cross AE within AAA'C’. Hence the edge must coincide 
with AAA'C’ on the side of AE opposite AA’. In this case, we make a new 
selection for C’ that is close enough to A so that AC’ contains no crossings of 
D. Then the triangle AA’C’ will not coincide with the diagram D except along 
AE. 


To complete the proof of Lemma 2.2.6 we perform the elementary move 
AABC via a sequence of three subelementary moves. (See Figure 2.2.8 for 
reference.) First apply the move that replaces segment AA’ with segments AC’ 
U C’A' (corresponding to AAA'C’), which, by design, is an R1 move. Next 
apply the moves AA'C’'C and AA'CB which, by construction, are both of type 
(1). @ 


Figure 2.2.8: A sequence of three elementary moves that results in the move 
AABC when composed. Starting with the elementary triangle in Figure 2.2.7, 
first apply the R1 move A AA'C’, as seen in (i). Then apply AA'C’'C, as seen in 

(ii), and AA'CB, as seen in (iii). 
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The previous lemma addresses elementary triangles of type (2), where exactly 
one edge emanating from A or B lies above or below AABC. How do we deal 
with type (2) triangles where both emanating edges lie above or below 

AABC? We consider this subcase in the following lemma. 
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Lemma 2.2.7. Suppose A and B are adjacent defining points of a link L and 
C € R8is a point such that the triangle AABC is an elementary triangle. Let D 
be a diagram for L U AABC. Suppose that both edges adjacent to AB lie 
above or below the elementary triangle in D. Then the elementary move 
AABC can be subdivided into a composition of elementary moves: two R1 


moves and other elementary moves of the type in case (1). 
Exercise 2.2.8. Use Lemma 2.2.6 to prove Lemma 2.2.7. 


Our list of localized elementary moves currently contains planar isotopies of 
types (i), (ii), and (iii) and the R1 move, but, as seen in Figure 2.2.1, there are 
still more moves we must add. Thanks to the previous two lemmas we can 
reduce our search from considering all possible elementary triangles to now 
only considering elementary triangles of type (1). This makes our search a bit 
more manageable. 


Using Figure 2.2.5 as a starting point, let’s systematically consider how 
pieces of diagram D may lie above or below AABC. Consider the cases where 


(a) no part of link L lies above or below AABC; 

(b) exactly one line segment lies above or below AABC; 
(c) exactly two line segments lie above or below AABC; or 
(d) more than two line segments lie above or below AABC. 


Clearly these cases exhaust all possibilities. Case (a) describes a planar 
isotopy of type (i), which is an elementary triangle that has already been 
added to our list. 


Exercise 2.2.9. For cases (b) and (c) listed above, draw an exhaustive list of 
representatives (up to planar isotopy) of how the line segments may be 
situated relative to triangle AABC. For example, in case (b) the line segment 
could cross AB or not cross AB. The subcase where it crosses AB has already 
been considered. Explain why. In the subcase where the segment doesn’t 
cross AB there are two cases, as shown in Figure 2.2.9. Case (c) can be 
separated into three subcases: the two line segments do not intersect above or 
below AABC; the two line segments are adjacent and thus AABC contains 
exactly one defining point; or the two segments cross within AABC. Note 
that, up to planar isotopy, we don’t care where a given line segment crosses 
AB, just that it does. 


Exercise 2.2.10. First, determine which of the figures in your solution to 
Exercise 2.2.9 result in an elementary move that is a planar isotopy. These 
moves are already on our list. Next, determine which of your figures are 
merely mirror images of each other. List only one figure from each mirror 
image pair and add it to our list. 


Figure 2.2.9: Examples of case (b). 
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In an effort to keep our list of elementary moves short, we will eliminate a 
few moves. If an elementary move on our list can be completed using other 
elementary moves already on our list, then we need not include it on our list. 
Such moves are called redundant. 


Exercise 2.2.11. Prove that the move in Figure 2.2.10 is redundant. That is, 
show that this move can be completed through a sequence of other moves on 
our list. (Hint: Consider the subdivision in Figure 2.2.10.) 


Figure 2.2.10: This elementary move can be subdivided to show that it is 
redundant. Are any additional subdivisions needed? 
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Exercise 2.2.12. Continuing with your list from Exercise 2.2.10, remove all 
moves that are redundant. Provide a proof that each removed move is actually 
redundant. 


Now we should have a complete list of all necessary R-moves up to mirror 


images. Check your list with Figure 2.2.11. 


Definition 2.2.13. The moves listed in Figure 2.2.11 are called R-moves. 
Link diagrams that are equivalent through a sequence of R-moves and planar 
isotopies are called R-equivalent. 


Figure 2.2.11: Is this list of R-moves complete? 
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Last, we consider case (d), where more than two line segments of L lie above 
or below the type (1) elementary triangle AABC. For this case, we subdivide 
the elementary triangle and show that our existing list of R-moves is 
sufficient. 


Proposition 2.2.14. Suppose A and B are defining points of a link L with 
diagram D and the triangle AABC is an elementary triangle for L such that 
the edges emanating from AB do not lie above or below AABC in diagram D. 
Suppose that more than two line segments of L lie above or below AABC. 
Then AABC can be subdivided into elementary subtriangles such that no 
more than two line segments of L lie above or below each subtriangle. 


Exercise 2.2.15. Prove Proposition 2.2.14 by describing an algorithm that 


results in a subdivision of AABC with the desired properties. (Hint: Start by 
drawing an example triangle and place several segments of L above and 
below the example triangle. Find a process that can be used algorithmically 
to subdivide your example triangle into subtriangles. Then think about how to 
describe and generalize your algorithm so it may be applied to an arbitrary 
AABC. In your proof, make sure to address why your algorithm will always 
work.) 


The results of this section can now be used to prove the main result of the 


next section, that A-equivalence is equivalent to R-equivalence. 


2.3 The Equivalence of A- and R-Equivalence 


Theorem 2.3.1. Let L and L’ be links and D and D’ be diagrams of L and L’, 
respectively. Then L and L' are A-equivalent if and only if D and D' are R- 
equivalent. 


Exercise 2.3.2. Prove Theorem 2.3.1. 


Figure 2.3.1: The three types of Reidemeister moves. 
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Knots and links are usually thought of and drawn as smooth curves. An 
informal way of dealing with this is to view the smooth arcs as polygonal 
curves with a very large number of segments arranged so that the curve 
appears smooth. From here onward, we will draw knots and links as smooth. 
The smoothed versions of the R-moves, shown in Figure 2.3.1, are merely the 
R-moves along with many planar isotopies, enough to make the diagram look 


smooth. We will often refer to these smoothed moves as Reidemeister moves, 
in honor of Kurt Reidemeister, the mathematician that discovered them [31]. 
We consider Reidemeister moves to be smooth R-moves together with 
smooth planar isotopies. If we prefer to be brief, we may also use R-moves to 
refer to the Reidemeister moves. 


Definition 2.3.3. The three Reidemeister moves, often denoted R1, R2, and 
R3, are given pictorially in Figure 2.3.1. We assume that when a 
Reidemeister move is performed on a link diagram, the part of the diagram 
located outside of the dotted circle remains unchanged. 


Figure 2.3.2: Two smooth knots that are equivalent via Reidemeister moves. 
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Exercise 2.3.4. The knots in Figure 2.3.2 are equivalent via Reidemeister 
moves. An equivalence is started in Figure 2.3.3, where each arrow denotes 
the application of a single Reidemeister move. Complete the proof of this 
equivalence. Each diagram in your equivalence should differ from the 
previous diagram by no more than one Reidemeister move and/or a planar 
isotopy. 


Notice in Figure 2.3.2, the equivalence from (b) to (c) denotes an R3 move. 
An R3 move locally transforms a diagram with three crossings and removes 
no crossings. Do you see why the loop in (c) is a required outcome of the R3 
move? 


Figure 2.3.3: A sequence of Reidemeister moves applied to K;. 


Exercise 2.3.5. Consider the knot projection P shown in Figure 2.3.4. 
Surprisingly, for each knot K in Figure 2.3.5, there is some choice of crossing 
information of P that yields K. (Indeed, the projection in Figure 2.3.5 
contains all knots that can be drawn with six or fewer crossings!) So for each 
knot K in Figure 2.3.5, your task is to choose crossing information for the 
precrossings in P to obtain K. Then use Reidemeister moves to show that 
your resolution of crossings in P yields a knot diagram of K equivalent to the 
particular diagram of K given in Figure 2.3.5. 


Figure 2.3.4: A 7-crossing knot projection, P. 


Figure 2.3.5: All knots that can be drawn with 6 or fewer crossings. 
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Exercise 2.3.6. In Figure 2.3.1, we illustrate two types of R1 moves, one R2 
move, and two types of R3 moves, for a total of five oriented moves. If we 
were to draw all possible oriented Reidemeister moves, i.e., R-moves where 
we choose an orientation for each strand involved in the move, how many 
moves would we generate? 


Mathematician Michael Polyak proved that, from our large set of oriented R- 
moves, there is a subset of just four R-moves that generate the entire set [30]. 
What exactly does this mean? Just as any vector in a vector space can be 
written as a linear combination of basis vectors, any oriented R-move can be 
derived from a sequence of R-moves from our generating set. It turns out 
there are many generating sets of four R-moves. We give one particular 
generating set in Figure 2.3.6. 


Figure 2.3.6: A generating set of oriented Reidemeister moves. 
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Exercise 2.3.7. In Figure 2.3.7, we picture an oriented R2 move that is not in 
the generating set shown in Figure 2.3.6. Show how this R2 move can be 
derived from a sequence of moves in the generating set. 


Figure 2.3.7: An oriented R2 move. 


2.4 Nonequivalence and Invariants 


Now that we have a well-defined notion of equivalence for links and link 
diagrams, we can begin to explore the idea of nonequivalence. To show that 
two diagrams represent equivalent knots or links, we need only find a 
sequence of Reidemeister moves that transforms one into the other. But how 
is it possible to show that two diagrams fail to be equivalent? If you think 
about it, you can see that failing to find a Reidemeister sequence relating two 
link diagrams is not proof that the two diagrams represent different links. 
Indeed, there are some complicated diagrams of the unknot that need to be 
made more complicated before they can be simplified. Sometimes a great 
deal of creativity is required to find a Reidemeister sequence relating two 
diagrams. 


The answer to the riddle of how to prove nonequivalence is contained in the 
notion of an invariant. Informally, a link or knot invariant is a function that 
takes in a link or knot and outputs a value. If two equivalent links or knots are 
plugged into this function, the function must return the same or equivalent 
values. Conversely, if two nonequivalent links or knots are plugged in, the 
invariant may return the same or different values. In the case when two links 
are plugged in and two distinct values are returned, we can be sure that the 
two input links are distinct. So link invariants provide tools to prove that 
certain link diagrams are nonequivalent. 


As an analogy, suppose you encounter a strange character (call him Mr. F) 
who you suspect to be your friend Tobias in disguise. To determine if Mr. F 
is indeed Tobias, you might apply the “height invariant.” Height is typically a 
fixed quality in young and middle aged adults. So if you determine that 


Tobias is about 6’1” and Mr. F is 5’7”, you can be sure that Mr. F is not 
Tobias in disguise. 


Now that we have some intuition, let us introduce the formal definition of an 
invariant. 


Definition 2.4.1. A knot or link invariant is a function from the set of 
equivalence classes of knots or links to a specified codomain. 


In Chapter 5 and beyond, we will study knot and link invariants that are 
functions into the set of integers, a set of matrices, or a set of polynomials. To 
prove that a function is indeed a link invariant, we must simply show that it is 
a well-defined function on the set of equivalence classes of links. In other 
words, we must show that if a link K is equivalent to a link J, then the 
function has the same value on K as is does on J. 


Before we develop invariants, however, let’s learn about some interesting 
examples of knots and links! 


Chapter 3 


Families of Links and Braids 


Whenever you learn a new mathematical concept, it is useful to have a 
collection of simple examples to apply the new concept to in order to gain 
intuition. For instance, 2 is a nice example of a vector space, Z is a useful 
example of a group, and the torus is an interesting, yet simple, example of a 
topological space. In this chapter, we introduce a few families of knots and 
links with particularly nice properties. These examples will provide some 
context for us to think about the knot invariants we will introduce in Chapter 
Bi: 


3.1 Twist Knots 


Twist knots are one of the simplest families of knots. They will be an 
interesting family to consider when we look at various combinatorial 
properties of knots. 


Definition 3.1.1. A twist knot of n half-twists, denoted T,,, is obtained by 


twisting two parallel strands n times and then hooking the ends together so 
that the knot is alternating, as seen in Figure 3.1.1. 


Figure 3.1.1: Examples of twist knots. 
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Exercise 3.1.2. Prove that knots (b) and (c) from Table 1.1 are twist knots. 
Are there any other twist knots in Table 1.1? 


Exercise 3.1.3. Prove that every twist knot is invertible. (Recall the definition 
of invertible from Section 1.5.) 


Exercise 3.1.4. Investigate the Knotting-Unknotting game (from Section 1.4) 
on the projection of twist knots T,, where n is even. Prove the following 


propositions. 


1. If Kenya (the knotter) plays second on a twist knot T,,, where n is even, 
then Kenya has a winning strategy. 


2. If Ulysses (the unknotter) plays second on a twist knot T,,, where n is 
even, then Ulysses has a winning strategy. 


Exercise 3.1.5. Investigate the Knotting-Unknotting game on the projection 
of twist knots T,, where n is odd. Who has a winning strategy when Kenya 


(the knotter) plays second? What about when Ulysses (the unknotter) plays 
second? Formulate and prove two propositions about these cases. 


3.2 Pretzel Links 


Another fascinating collection of links is the family of pretzel links. These 
links are both rich in structure but also easy to visualize since they are made 
out of simple twists. 


Definition 3.2.1. Let p, q, and r be integers. A 3-strand pretzel link P,, , 


can be constructed as follows. Take three pairs of string segments and 
arrange them vertically. Twist the bottom ends of the first pair p times (in the 
counterclockwise direction if p > 0 and in the clockwise direction if p < 0). 
Twist the bottom ends of the second pair g times and the bottom end of the 
third pair r times. After twisting the pairs, glue (or tie) more strands of string 
to adjoin the ends of the three pairs as in the examples pictured in Figures 
3.2.1 and 3.2.2. 


Figure 3.2.1: The pretzel link Ps _3 7. 
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Exercise 3.2.2. How many components do the pretzel links in Figures 3.2.1 
and 3.2.2 have? Is either one of these links a knot? 


Figure 3.2.2: The pretzel link Py 5 5, 


Exercise 3.2.3. For which values of p, g, and r is the link P a knot? For 


PQ? 


which p, q, and r does P,,,, have more than one component? Conjecture 


conditions on p, g, and r that determine the number of components in nr 
and then prove your conjecture. 


Exercise 3.2.4. (i) Investigate the invertibility of the knot P75 4 
(ii) Investigate the invertibility of the knot P75 3 


Exercise 3.2.5. (i) Prove that the pretzel knot P;_;_; is equivalent to the 
twist knot T,. 


(ii) In general, are there values of p, q, and r such that P, 4, is equivalent to 
T,? 

Exercise 3.2.6. Investigate if there is a relationship between P,,. and Py 4 
and P. ap: 
is a relationship between P,,. and a pretzel link with a noncyclic 


If you hrid a relationship, explain it. Furthermore, explore if there 


permutation of the original link’s indices, like P, .,,. What did you discover? 


Exercise 3.2.7. Determine a condition on the integers a, b, and c that 
guarantees P, ,, is alternating. 


Exercise 3.2.8. Prove that for positive integers q and r, the link P,_, is an 
alternating link. 


3.3 Torus Links 


Torus links are one of the most accessible, symmetric, and interesting infinite 
families of links. We see several examples of knots and links in this family in 
Figare 3.3.1. 


Figure 3.3.1: Examples of torus links. 


Tg Gooc » Tie 


These links are called ‘torus links’ because each member of this family can 
be embedded on the surface of a torus (think: doughnut) without any 
crossings, as shown in Figure 3.3.2. 


Figure 3.3.2: The torus link T;_5 embedded on the surface of a torus. 


Definition 3.3.1. A torus link diagram T,, , is defined if p > 0 and q, 9 «© Z. 
Begin with p horizontal strands. For q positive, the bottommost q strands are 
—one at a time—wrapped up and over the remaining strands. For q negative, 
the topmost q strands are—one at a time—wrapped down and over the 
remaining strands. Once the wrapping is complete, the strand ends are 


connected so that no new crossings are made. A link L is called a torus link 
if it is equivalent to a torus link diagram. 


Exercise 3.3.2. Draw torus links Tg 9 and Tj _¢. For the four examples given 
in Figures 3.3.1, as well as links Tg 4 and T;9_,¢, determine the number of 
components in each link. Which are knots? 


Exercise 3.3.3. Conjecture a condition on p and q for T,, , which results in a 
one-component torus knot and prove your conjecture. 


Exercise 3.3.4. The families of pretzel links and torus links are not distinct. 
Show that the pretzel knot P_ 33 is a torus knot. Can you find another pretzel 


knot that is a torus knot? 


Exercise 3.3.5. Investigate whether or not the family of twist knots is distinct 
from torus knots. Is there a twist knot that is also a torus knot? If so, give an 
example. If not, explain why not. 


Exercise 3.3.6. Prove that the torus knot T3 , is equivalent to the knot in 
Figure 3.3.3, which is known as the knot 8,9. 


Figure 3.3.3: The knot 819 is a torus knot. 


Exercise 3.3.7. Investigate whether or not the knot 8,9 is an alternating knot. 


Exercise 3.3.8. Investigate whether or not the knot T> 3 is equivalent to T3 5. 
Can your findings be generalized? 


3.4 Closed Braids 


The family of links called closed braids are a generalization of torus links. 
We define closed braids by considering a diagram of n concentric strands. 
For example, see Figure 3.4.1 with 4 concentric strands. A small disc-like 
subregion of the diagram is called a replaceable region if it contains arcs 
from two adjacent concentric strands and nothing else. See Figure 3.4.2 for 
examples and nonexamples of replaceable regions. 


Definition 3.4.1. A closed n-braid diagram is obtained from a diagram of n 
concentric strands by identifying finitely many nonintersecting replaceable 
regions and replacing the two strands within each region by a crossing. A 
knot or link is a closed braid if it is equivalent to a closed n-braid diagram 
for some n. 


Figure 3.4.1: Four concentric strands. 


Figure 3.4.2: Examples of replaceable regions are shown in (a). The diagram 
(b) shows a nonexample of a replaceable region because it contains three 
strands. 


(a) 


Figure 3.4.3: An example of a closed braid resulting from placing crossings 
in the replaceable regions shown in Figure 3.4.2 (a). 
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By construction, a closed n-braid diagram has an innermost region with the 
property that any line segment drawn from the innermost region to a point 
that is exterior to the link will intersect the diagram at exactly n points. We 
count both the under- and over-strands if the segment happens to pass 


through a crossing. In Figure 3.4.4, we see an example that is not in closed 
braid form because it fails the line-segment condition. 


Figure 3.4.4: A diagram that is not in the form of a closed braid diagram. 
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Closed braids can have one or more components. In Figure 3.4.5, the closed 
braid (a) is a link while the closed braid (b) is a knot. 


Figure 3.4.5: (a) A two component closed braid link, and (b) a closed braid 
knot. 
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Exercise 3.4.2. Identify concentric circles and replaceable regions to show 
that the torus link T,, , is a closed braid. 


As we saw in Chapter 2, diagrams that look quite different can be equivalent 
via Reidemeister moves and, thus, represent the same knot. In our current 
setting, this means that a knot is a closed braid if it can be represented in a 
closed braid diagram. Let’s investigate whether certain examples of the knots 
and links we've already studied are closed braids. If a given example does not 
initially look like a closed braid, could they be changed via Reidemeister 
moves into a diagram of a closed braid? 


Exercise 3.4.3. Identify six diagrams in Table 1.1 that, up to planar isotopy, 
are in closed braid form. 


Exercise 3.4.4. Play with the knot in Figure 3.4.6 to discover how to put it 


into closed braid form. (Hint: Make the region with the dot the innermost 
region. To do so, only one strand needs to be moved—the bold strand—but 
the movement will require several Reidemeister moves. Write down a 
sequence of diagrams that show how to put the knot into closed braid form.) 


Figure 3.4.6: Is this knot diagram equivalent to a closed braid diagram? 
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Exercise 3.4.5. Show that knot (n) from Table 1.1 can be written in closed 
braid form. 


Exercises 3.4.6 and 3.4.5 show that even though a knot diagram may seem far 
from being in closed braid form, it is sometimes possible to manipulate it into 
a closed braid form. Is this always possible? Surprisingly, the answer is yes! 
In preparation to prove that all knots are closed braids, we need some new 
notation that uses the sign of a crossing. 


Definition 3.4.6. For a crossing in an oriented link diagram, the sign of the 
crossing is determined by the “right hand rule.’” Imagine placing your thumb 
on the over-strand of the crossing, pointing in the direction of the orientation, 
and curling your fingers under and around in the direction of the under- 


strand. A crossing is called a positive crossing when you must use your right 
hand to curl your fingers in the direction of the under-strand. A crossing is 
called a negative crossing when you must use your left hand to curl your 
fingers in the direction of the under-strand. Figure 3.4.7 shows examples of 
positive and negative crossings. 


Figure 3.4.7: A positive crossing and a negative crossing. 
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We introduce an alternative diagrammatic notation that highlights certain 
features of a closed braid. Here is how it works for diagrams that are already 
in closed braid form. Start by giving the closed braid diagram an orientation 
such that all concentric strands will be traversed clockwise or all will be 
traversed counterclockwise about the innermost region. In the alternative 
notation for our braid diagram we will replace each positive crossing of the 
braid with a line segment labeled with a positive sign, “+,” and replace each 
negative crossing with a line segment labeled with a negative sign, “—”. 
Figure 3.4.8 shows an example of how this is done for a diagram in closed 
braid form. 


Figure 3.4.8: An example of an oriented closed braid and the alternate 
notation for this braid consisting of oriented circles and signed-line segments. 


The alternate notation described above is easy to turn back into a closed 
braid. As seen in Figure 3.4.8, we simply replace each signed-line segment 
with a crossing of the corresponding sign. 


To prove that every knot is equivalent to a closed braid, we develop an 
analogous way to transform an arbitrary knot diagram into a diagram of 
oriented circles and signed-line segments. As is the case for the alternate 
notation applied to a closed braid diagram, we will create oriented circles that 
consistently follow the orientation of the diagram and signed-line segments 
that connect the oriented circles. Thus, given an arbitrary knot diagram, we 
will smooth each crossing so that the local orientation of the arcs are 
consistent with the orientation of the diagram. Then, we replace the crossing 
with an appropriately signed-line segment connecting the two oriented arcs. 
Each newly formed circle in the resulting diagram has a well-defined 
orientation because each local smoothing respected the global orientation of 
the knot. An example of this process applied to a negative crossing is shown 
in Figure 3.4.9 and an example of it applied to a positive crossing is shown in 
Figure 3.4.10. 


Figure 3.4.9: (a) An arbitrary negative crossing, (b) a smoothing of the 
crossing that is consistent with the orientation of the diagram, and (c) a 
replacement of the crossing with a signed-line segment that connects the two 
arcs. 
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Figure 3.4.10: (a) An arbitrary positive crossing, (b) a smoothing of the 
crossing that is consistent with the orientation of the diagram, and (c) a 
replacement of the crossing with a signed-line segment that connects the two 
arcs. 
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Definition 3.4.7. The Seifert diagram of an oriented knot diagram is the 
diagram of oriented circles and signed-line segments that is obtained by 
applying the smoothing and replacement procedure, shown in Figures 3.4.10 
and 3.4.9, to every crossing in the oriented diagram. The orientated circles in 
the Seifert diagram are called Seifert circles. 


The twist knot T, and its Seifert diagram are shown in Figure 3.4.11. 


Figure 3.4.11: The knot T, and its Seifert diagram. 
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Exercise 3.4.8. Construct the Seifert diagrams of the knots T; and P_3 53, 


pictured in Figure 3.4.12. 


Figure 3.4.12: The knots T3 and P_; 5 3. 
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Next, we look at various Seifert diagrams of knots that are not presented in 
closed braid form and look for ways to manipulate them into a closed braid 
form. There is one important caveat. We must take care when manipulating a 
Seifert diagram to ensure that the resulting diagram represents an equivalent 
knot. Our goal is for the Seifert circles within the final diagram to be 
concentric and coherently oriented. 
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Exercise 3.4.9. Figure 3.4.13 represents the Seifert circle representation of a 
knot diagram. This diagram is not a closed braid because the circle labeled C 
fails to be concentric. Show that the diagram can be altered slightly to 
represent a closed braid. Prove that your alteration preserves knot 
equivalence by drawing the knot diagrams before and after the alteration and 
describing the Reidemeister moves that prove these two diagrams are 
equivalent. 


Figure 3.4.13: This diagram is almost in closed braid form. 
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Exercise 3.4.10. Generalize your proof above to the following, more general 
scenario. Suppose a Seifert diagram contains two closed braid subsections 
that are oppositely oriented and connected by a collection of crossings 
between the outermost circles of each subsection (for example, see Figure 
3.4.14). Prove that such a diagram is equivalent to that of a closed braid. 
(Hint: Recall that Reidemeister moves are not defined on Seifert diagrams. 
Thus, an argument proving equivalence must use the knot diagrams, not the 
Seifert diagrams.) Your proof should be a general argument that does not rely 
on Figure 3.4.14 specifically.) 


Figure 3.4.14: This diagram is not in closed braid form. Can it be altered to 
produce an equivalent diagram that is in closed braid form? Investigate! 


There are other types of Seifert diagrams that we have yet to consider, for 
example, the Seifert diagram of the knot T3 from Exercise 3.4.8. The Seifert 


diagram for this knot has four circles and none are concentric. What do we do 
in this case? Let’s simplify this problem and consider a pair of Seifert circles. 
Once we gain some intuition, we can try to generalize our results so that they 
apply to the Seifert diagram T3. There are two cases to consider when looking 


at a pair of non-nested Seifert circles: either the two circles have the same 
orientation or they have opposite orientation (one clockwise and one 
counterclockwise). Our previous argument in Exercise 3.4.10 can be used to 
address the latter case, so let’s focus on the case where the two circles have 
the same orientation, as in Figure 3.4.15. 


Figure 3.4.15: Two Seifert circles, both with clockwise orientation. 


To put the circles in Figure 3.4.15 into a closed braid form, the obvious thing 
to do is to pick up one circle and place it inside the other. Although this idea 
will work in this simple case, it is not generalizable to the case where the two 
Seifert circles are connected to other circles via signed-line segments. So 
instead, let’s use an R2 move on a portion of each circle. Since the circular 
arcs in a Seifert diagram represent the strands of a link, we can perform an R2 
move in a region adjacent to the circles where no signed arcs are located. 
When an R2 move is performed, there will be two new crossings introduced 
that must then be smoothed using the Seifert diagram technique to produce a 
Seifert diagram. 


Exercise 3.4.11. Determine the Seifert diagram associated to Figure 3.4.15 
after an R2 move is applied. 


Exercise 3.4.12. Transform the Seifert diagram of the twist knot T3 into a 


closed braid form. (Hint: Pair up similarly oriented but nonconcentric Seifert 
circles and use the R2 move strategy from Exercise 3.4.11. You may also 


need the techniques from Exercise 3.4.10 to create your final braid diagram.) 


Exercise 3.4.13. Prove that every twist knot can be transformed into a closed 
braid. (You are encouraged to think about this proof as precisely describing 
an algorithm that will transform the Seifert diagram of a twist knot into 
closed braid form. Throughout the algorithm, you should explain not only 
how to apply the algorithm, but why you know that you will be able to apply 
the algorithm and why you know your algorithm will terminate.) 


As we work toward proving that every knot can be represented as a closed 
braid, we introduce new terminology for pairs of Seifert circles and a 
measurement of how far away a Seifert diagram is from closed braid form. A 
pair of circles within a Seifert diagram is called an incoherently oriented 
pair if the circles are nested and have opposite orientation, or if they are not 
nested and have the same orientation. In Figure 3.4.16, we see two examples 
of incoherently oriented pairs. Such a pair of circles would not be seen in a 
Seifert diagram that is in closed braid form. On the other hand, the pairs of 
circles in Figure 3.4.17 are said to be coherently oriented pairs. These pairs 
are either nested circles with the same orientation, or non-nested circles with 
opposite orientation. Notice that any two circles in a closed braid diagram are 
a coherently oriented pair. 


Figure 3.4.16: Two pairs of incoherently oriented circles. 


Figure 3.4.17: Two pairs of coherently oriented circles. 


The following exercise shows that some diagrams of oriented circles 
connected by signed segments cannot be attained as Seifert diagrams of 
oriented knots or links. 


Exercise 3.4.14. Prove that a pair of incoherently oriented circles within a 
Seifert diagram cannot be connected by a signed arc. (Note that there are a 
total of four distinct pairs of incoherently oriented circles, two of which are in 
Figure 3.4.16.) 


Exercise 3.4.15. Use Exercise 3.4.14 to show that the diagram in Figure 
3.4.18 cannot be part of a Seifert diagram, regardless of how orientations are 
assigned to the circles. 


Figure 3.4.18: This diagram cannot be part of a Seifert diagram. 


Next, we define a measurement of how far away a Seifert diagram is from 
closed braid form. 


Definition 3.4.16. The height of a Seifert diagram D, denoted h(D), is the 
number of distinct pairs of incoherently oriented Seifert circles in D. 


Notice that h(D) = 0 if and only if D is (1) in closed braid form or (2) in the 
generalized form of Figure 3.4.14. As we saw in Exercise 3.4.10, the diagram 
in Figure 3.4.14 is just a short manipulation away from being in closed braid 
form. 


Exercise 3.4.17. Show that the Seifert diagram for the knot T, in Figure 
3.4.11 has height 3. 


Exercise 3.4.18. Draw a Seifert diagram of a knot that has height 1 anda 
Seifert diagram of a knot that has height 2. How many such diagrams of 
height 1 or 2 can you find? (Make sure your diagrams are legal Seifert 
diagrams by applying Exercise 3.4.14.) 


Exercise 3.4.19. Determine the height of the Seifert diagram in Figure 3.4.19. 


We know that if a Seifert diagram contains a pair of incoherently oriented 
circles, then it cannot be in closed braid form. The R2 move described in 
Exercise 3.4.11 shows how a pair of non-nested, incoherently oriented circles 
can be transformed into a pair of coherently oriented circles. The next 
exercise shows that whenever such an R2 move can be performed, doing so 
will reduce the height of the Seifert diagram. 


Figure 3.4.19: A Seifert diagram. 


Exercise 3.4.20. Suppose C, and C; are an incoherently oriented pair of 


Seifert circles within a Seifert diagram D that are adjacent to a common 
region of the Seifert diagram. Prove that performing an R2 move on C, and 


C, within the common region results in a Seifert diagram D’ with h(D') < 
h(D). 


We are very close to proving that every knot can be represented as a closed 
braid! There is only one more piece in our puzzle. We need to show that in 
every Seifert diagram with non-zero height, there exists a region adjacent to a 
pair of incoherently oriented Seifert circles in which we can apply an R2 
move. 


Definition 3.4.21. A defect region of a Seifert diagram is a region of a 
Seifert diagram that is adjacent to two incoherently oriented circles. 


An example of a defect region is shown in Figure 3.4.20. The red dotted 
curves in Figure 3.4.21 show two possible places where an R2 move can be 
applied to an incoherently oriented pair of Seifert circles. 


At last, we prove the final step, that if a Seifert diagram contains a pair of 
incoherently oriented circles, then it will also contain a defect region. 


Figure 3.4.20: The shaded region is a defect region of the Seifert diagram. It 
happens to be adjacent to two pairs of incoherently oriented circles. 


Figure 3.4.21: The height-reducing R2 move can be applied along either of 
the red dotted curves shown in these figures. 


Exercise 3.4.22. Let K be an arbitrary knot with Seifert diagram D. If h(D) > 
0, then D contains a defect region. 


We now have all the necessary ingredients to prove what is known as 
Alexander’s Theorem. This result was originally proved by James W. 
Alexander II in 1923 [3]. 


Theorem 3.4.23. [Alexander’s Theorem] Every knot can be represented as 
a closed braid. 


Exercise 3.4.24. Prove Alexander’s Theorem. 


Chapter 4 


Knot Notation 


In Section 1.7, you created your own knot notation that could be used to 
encode the information contained in an alternating knot diagram. Perhaps 
your knot notation was even flexible enough to encode the information in any 
knot diagram. If so, you may have discovered one of the many notations 
commonly used to encode knots by knot theorists today! In this chapter, we 
focus on several common ways to encode knotting information. Some of 
these codes involve strings of symbols, and some will require us to draw new 
types of diagrams. 


4.1 DT Notation 


One of the most common notations in use today is called DT notation, or 
Dowker-Thistlethwaite notation, named after knot theorists Clifford Hugh 
Dowker and Morwen Thistlethwaite. Here’s how it works. 


Starting at a given point on the knot diagram (which we will call the base 
point), travel in the direction of the orientation, labeling the first crossing you 
encounter with a 1, the second crossing you encounter with a 2, and so forth. 
Continue this process until every crossing has two labels and you have 
returned to the base point. Now, travel through your knot diagram a second 
time. For each time you pass over a crossing where you had placed an even 
number, make this even label negative. Even labels that were assigned as you 
passed under a crossing should remain positive. 


For example, see Figure 4.1.1. When we labeled the second crossing, we 
were passing over the crossing, so our label should be —2. On the other hand, 
when we assigned a label to the fourth crossing, we were passing under the 
crossing, so the label should remain a positive number. 


Now that we have our diagram labeled, we create an n-tuple of ordered pairs 
where the first element of each ordered pair is an odd number, and the second 
element is even. The number of elements n in this tuple is equal to the 
number of crossings in our diagram. Here’s how we create this ordered list. 
The first element of the n-tuple should be (1, m) for whichever integer m is 
paired with 1 in the diagram. For the example in Figure 4.1.1, for instance, m 
would equal —6. The second element of the n-tuple should be of the form (3, 
q) where q is the integer paired with 3, etc. 


Figure 4.1.1: A knot diagram with a DT-labeling. 


The 5-tuple we would create from Example 4.1.1 would be 


((1, —6), (3, 10), (5, 8), (7, —2), (9, 4)). 


Of course, when we form such a list, the odd numbers become entirely 
superfluous. For any n-tuple we create in this way, the first elements of the 
ordered pairs will always be the first n odd numbers, in order. So the final DT 
code we derive to represent the knot diagram is simply the ordered n-tuple of 
even numbers in the list, e.g., (6, 10, 8, —2, 4). 


Exercise 4.1.1. Explain why labeling a knot diagram using the DT algorithm 
always gives you one odd and one even number at each crossing. 


Exercise 4.1.2. Reconstruct the knot diagram that has DT code (—6, —8, 12, — 
2, 14, 16, 4, 10). Is the knot diagram alternating? 


Exercise 4.1.3. Find the DT code for knots (b) and (c) from Table 1.1 as 
follows. For both knots, select a base point and an orientation so that the first 
crossing you encounter, you pass over the crossing. What conclusions can 
you make from this exercise? 


Exercise 4.1.4. Can you identify whether or not a knot diagram is alternating 
by looking at a DT code for the diagram? If so, explain how. If not, why not? 


Exercise 4.1.5. Draw your favorite knot diagram and find the diagram’s DT 
code. Have a friend draw their favorite knot diagram and do the same. Now, 
each of you should make a copy of your diagram, without labeling the 
crossings with DT code numbering and without specifying the starting point. 
You may, however, mark the orientation of your knot diagram. Finally, swap 
diagrams without giving away your notation. Find the DT codes for each 
other’s diagrams. Did you come up with the same codes? Why or why not? 


Exercise 4.1.6. Explore the effect on the DT code of moving your base point 
along the knot diagram past a single crossing. 


For the next exercises, it will be helpful to use the following terminology. 


Definition 4.1.7. A knot together with a base point is referred to as a based 
knot. Similarly, a knot diagram with a specified base point is referred to as a 
based-knot diagram. 


Exercise 4.1.8. How does the DT code of an oriented-based-knot diagram D 


compare to the DT code of the same based-knot diagram with the opposite 
orientation? Formulate and prove a conjecture about this relationship. 


Exercise 4.1.9. How does the DT code of an oriented-based-knot diagram D 
compare to the DT code of the mirror image of D? Formulate and prove a 
conjecture about this relationship. 


We have seen that Reidemeister moves don’t affect the knot type, but they do 
affect the knot diagram. Since the DT code is dependent on the specific 
diagram of the knot we are encoding, let’s think about how the DT code 
changes as we perform Reidemeister moves. 


Figure 4.1.2: Equivalent knot diagrams. From left to right, each diagram 
differs from the preceeding diagram by exactly one Reidemeister move. 


Exercise 4.1.10. Determine which Reidemeister moves have been performed 
to obtain each diagram in the sequence in Figure 4.1.2 from the first diagram. 
Then find the DT codes corresponding to these knot diagrams for a consistent 
choice of base point. 


Exercise 4.1.11. Determine the effect on the DT code of performing an R1 
move on a knot diagram. 


Exercise 4.1.12. Determine the effect on the DT code of performing an R2 
move on a knot diagram. 


Exercise 4.1.13. Determine the effect on the DT code of performing an R3 
move on a knot diagram. 


Before moving on to the next way of encoding knotting information, discuss 
with a friend the pros and cons of using DT notation to encode knotting 


information. 


4.2 Gauss Codes & Gauss Diagrams 


So far, we have been considering knots by looking at their knot diagrams in 
the plane. There is another, quite useful visual way of representing a knot 
called a Gauss diagram. A Gauss diagram is made from a circle together 
with several decorated chords. More specifically, each chord is decorated 
with an arrowhead and a sign. Here is an example. 


Figure 4.2.1: A knot diagram and its corresponding Gauss diagram. 


So what exactly is the correspondence between these two types of diagrams? 
Let’s figure it out by focusing on our example. Begin by taking a look at the 
knot diagram in Figure 4.2.1. Find the base point on the diagram, and notice 
that each of the crossings is labeled by a single number. Now, travel from the 
base point around the knot diagram in the direction of the orientation. As you 
travel, record the order in which you encounter the crossings. You might 
record this order in a list, such as: 


You might even record a bit more information by noting when you pass over 
(O) and when you pass under (U) each crossing. For instance, you could 


write the following: 
U1 O02 O03 U4 05 O1 U2 U5 04 UB. 


This sequence is called a Gauss code and can already be used to encode a 
great deal of information about the knot. 


Exercise 4.2.1. Find a Gauss code for the the pretzel knot in Figure 4.2.2. 


Figure 4.2.2: A pretzel-knot diagram. 
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Let’s return to our original example from Figure 4.2.1. To create a Gauss 
diagram from this code, separately draw a circle with a base point on it. 
Travel around the circle counterclockwise from the base point, listing the 
numbers 1, 2, 3, 4, 5, 1, 2, 5, 4, 3 at regular intervals around the circle as you 
go. You should have two 1s, two 2s, etc. Connect each of these pairs of 
numbers by a chord. What you have now is a chord diagram, the first step to 
creating your Gauss diagram! 
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You already recorded in your Gauss code when you passed over and when 
you passed under each crossing. To record this same information in the Gauss 
diagram, decorate the chords in your chord diagram with arrowheads, making 
the arrow point toward the under-strand of the crossing. For instance, you 
first pass under crossing 1 when you travel around the knot from the base 
point. Thus, your arrow should be pointing toward your first encounter with 
chord 1 in the Gauss diagram. Add arrowheads to all chords in your chord 
diagram and then compare it with Figure 4.2.1 to see if you're on the right 
track. 


Finally, determine the sign of each crossing, as in Definition 3.4.6, and 
decorate each arrow of your Gauss diagram with its sign. Voila! You are 
done. Your Gauss diagram should look the same as the one in Figure 4.2.1. 


Exercise 4.2.2. Create a Gauss diagram for the pretzel knot in Figure 4.2.2. 


Exercise 4.2.3. Create Gauss diagrams for all of the knot diagrams in Figure 
1.1. (You may ignore any links with more than one component!) 


Exercise 4.2.4. Suppose you are given a Gauss diagram that is associated to a 
knot diagram, but you aren’t sure what the knot diagram looks like. Can you 
tell by looking at the Gauss diagram whether or not the corresponding knot 
diagram is alternating? 


You may ask yourself a natural question at this point. Do different knot 
diagrams of the same knot correspond to different Gauss diagrams? To 
answer this question, let us consider the effect on the corresponding Gauss 
diagrams of performing a Reidemeister move on a knot diagram. 


Exercise 4.2.5. Find the Gauss diagrams corresponding to the four equivalent 
knot diagrams in Figure 4.1.2. What can you conclude about the effect of 
Reidemeister moves in a Gauss diagram? 


Using the previous exercise for intuition, let’s think more systematically 
about the effects different R-moves have on Gauss diagrams of knots. 


Exercise 4.2.6. Determine all possible effects on the corresponding Gauss 
diagram of performing an R1 move on a knot diagram. 


Exercise 4.2.7. Show that if a Gauss diagram contains only nonintersecting 
arrows, as in Figure 4.2.3, then it must be a Gauss diagram of the unknot. 


Exercise 4.2.8. In Figure 4.2.4, we see a Gauss diagram schema that 
illustrates one possible effect of an R2 move on a Gauss diagram. (The dotted 
portions of the circle in the Gauss diagram indicate that any additional chord 
endpoints may appear in this portion of the diagram. The solid portion of the 
circle in the Gauss diagram indicates that no additional chord endpoints 
appear in this portion of the diagram.) Are there other Gauss diagrammatic 


R2 moves? Determine all possible effects on the corresponding Gauss 
diagram of performing an R2 move on a knot diagram. Share your findings 
by providing a complete list of one or more Gauss diagrammatic schemas, 
including the one shown in Figure 4.2.4. 


Figure 4.2.3: A Gauss diagram containing only nonintersecting arrows. 
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Figure 4.2.4: A Gauss diagrammatic Reidemeister 2 move. 
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Exercise 4.2.9. Determine all possible effects on the corresponding Gauss 
diagram of performing an R3 move on a knot diagram. Record your findings 


by providing Gauss diagrammatic schemas, as in Figure 4.2.4, for each R3 
move. 


Exercise 4.2.10. Compare the results you obtained in the previous exercises 
to the Gauss diagrammatic R-moves illustrated in Figure 4.2.5. Did you 
discover any possibilities that are missing in Figure 4.2.5? 


As we learned in Section 2.3, all oriented R-moves can be derived from the 
generating set of moves in Figure 2.3.6. So while you may have found 
several related valid Gauss diagrammatic R-moves, the moves in Figure 4.2.5 
will suffice to generate all possible equivalences of Gauss diagrams. 


Figure 4.2.5: Gauss diagrammatic Reidemeister moves. The n is a variable 
that may represent + or -. 


Figure 4.2.6: A Gauss diagrammatic R3 move that does not appear in Figure 
4.2.5. 


Exercise 4.2.11. Notice that the Gauss diagrammatic R2 move shown in 
Figure 4.2.4 and the R3 move shown in Figure 4.2.6 are not pictured in 
Figure 4.2.5. Prove that the two moves shown in Figures 4.2.4 and 4.2.6 can 
be obtained via a sequence of Gauss diagrammatic R-moves that are all in 
Figure 4.2.5. Your proofs must respect the dotted regions of the circle in the 
Gauss diagram, meaning that your argument cannot assume no other chords 
are within the dotted regions. 


Exercise 4.2.12. You just learned how to make Gauss codes and Gauss 
diagrams for knots, but what about links with more than one component? 
Invent an extension of Gauss notation for two-component links. 


4.3 Rational Tangles and Knots, and Conway Notation 


Another type of knot notation, called Conway notation, is rather different 
than the notations we've discussed so far. Conway notation can be more 
difficult to determine for an arbitrary diagram, but it is incredibly useful for 
defining and studying an interesting family of knots called rational knots. 
We will narrow our focus here and use Conway notation to describe only 
rational knots. Readers that are interested in exploring Conway notation for 
nonrational knots can find more information here [9]. 


The building blocks of rational knots, called tangles, are special types of 
subdiagrams of a knot or link diagram. 


Definition 4.3.1. A tangle diagram is a diagram contained in a disk that 
consists of two arcs whose four endpoints are fixed along the boundary of the 
disk. The disk is denoted by a dashed circle surrounding the subdiagram. 
Sometimes the dashed disk boundary is omitted, but implied. When drawing 
a tangle, we situate the four fixed endpoints roughly on the NW, NE, SE, and 
SW compass points. 


Figure 4.3.1: Examples of tangle diagrams. 
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We can view a tangle diagram as a subdiagram of a knot or link by imagining 
that the two arcs in the tangle extend into a larger knot or link diagram 
outside of the disk. 


Definition 4.3.2. Two tangle diagrams are equivalent provided that one can 
be continuously deformed into the other via planar isotopies and 
Reidemeister moves performed within the dashed circle while keeping the arc 
endpoints fixed. The symbol ~ denotes tangle equivalence. 


Exercise 4.3.3. To practice working with tangle equivalence, find two pairs 
of equivalent tangles in Figure 4.3.2. Remember, when manipulating a tangle, 
the four endpoints must remain fixed. 


Figure 4.3.2: Which pairs of tangles are equivalent? 
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Conway’s notation for rational knots elucidates the rich algebraic and 
geometric structures that stem from combining and manipulating tangles. 
Tangles can be summed, multiplied, rotated, mirrored, inverted, and flipped. 


et cai 


Definition 4.3.4. Let T, T,, T>, and Q be tangles. The tangle sum of T, and 
T,, denoted T, + T> and depicted in Figure 4.3.3, is the result of horizontally 
arranging T; and T, and connecting their adjacent arc endpoints. The tangle 
product of T, and T,, denoted T, * T, and depicted in Figure 4.3.4, is the 
result of vertically stacking T, and T, and connecting adjacent arc endpoints. 
The rotation of a tangle T, denoted 7", rotates T clockwise by 90 degrees. 
The mirror of a tangle T, denoted — T’, is the tangle resulting from switching 
all crossings in T. The inverse of a tangle T, denoted T’, is defined to be — T’, 
the mirror of the clockwise 90-degree rotation of T. The last two operations 
on a tangle Q are the vertical and horizontal flips of Q over a vertical or 
horizontal line, denoted by QV’? and Qf", respectively, and depicted in 
Figure 4.3.6. 


Figure 4.3.3: The sum of two tangles, T, + T>. 
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Figure 4.3.4: The product of two tangles, T, * Ty. 


Figure 4.3.5: The rotation and inverse of a tangle. 


The tangle building blocks that are used to create all rational tangles are the 
integer and reciprocal tangles, examples of which are shown in Figures 
4.3.7 and 4.3.8. 


Definition 4.3.5. We define a rational tangle as a tangle that can be 
constructed inductively, as follows. All integer and reciprocal tangles are 
rational tangles. Given a rational tangle T,, the tangle T;, that is obtained 


from T, by either (1) adding an integer tangle or (2) multiplying by a 
reciprocal tangle is also a rational tangle. A diagram of a rational tangle, T, 
that depicts the finite sequence in the inductive construction of T is called a 
twist diagram of the rational tangle T. 


Figure 4.3.6: The vertical and horizontal flip of a tangle. 


Figure 4.3.7: Examples of integer tangles. 
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Figure 4.3.8: Examples of reciprocal tangles. 
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Figure 4.3.9: Some twist diagrams of rational tangles. 
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When first looking at a twist diagram, it can be difficult to determine the 


sequence of integer and reciprocal tangles used to make the rational tangle. 
However, if you are given a twist diagram, you will be able to identify the 
last summand or factor because two adjacent arc endpoints (from that last 
summand or factor) can be untwisted to remove it from the tangle. In other 
words, from T;.,,, we can always find two adjacent arc endpoints in the tangle 


that can be untwisted (some number of times) to uncover T;,. 


The second to last summand or factor can be identified similarly, and so on, 
until the starting point of the rational tangle is apparent. For example, we can 
find the construction sequence for rational tangle (a) in Figure 4.3.9 by 
observing it has a last summand of [—1] on the right of the diagram. 
Untwisting this rightmost [-1] summand, we see the second to last term was 
multiplication below by the reciprocal tangle —z. Untwisting the factor +5, 
we see a left summand of [2] and a right summand of [-4]. Untwisting both 
integer tangles [2] and [-4], we see that the rational tangle (a) was 
constructed starting with the reciprocal tangle 7. Finally, we arrive at the 
following notation for the rational tangle (a) in Figure 4.3.9: 


Exercise 4.3.6. Write the rational tangle (b) in Figure 4.3.9 as a sum and 

product of integer and reciprocal tangles by using the untwisting strategy 
described above. Be careful to use parentheses appropriately in your final 
notation! 


In the next exercise, we will show that the notation used for describing a twist 
diagram is not unique. For example, the rational tangle (a) in Figure 4.3.9 can 
also be described as: 


((el+ g++ sl) * op) + eu 


or, by an equivalence, as 


Exercise 4.3.7. Prove the following three statements. 


1. For all integers n and m, [n] + [m] ~ [n + m]. 


2. For all integers n and m, + * ¢y ~ poy. 


3. For any nonzero integer n, {n'' = + and (4) = in}. Also, [0]! = [co] and 


[0]! = [0]. 


Now that we are familiar with the construction of rational tangles, a 
reasonable question to ask is why do we restrict to summing integer tangles 
and multiplying by reciprocal tangles? Can’t we add two reciprocal tangles? 
Or multiply by an integer tangle? Well, in truth, both of these things can be 
done; however, the result is generally not a rational tangle. One of the key 
properties of a rational tangle is that, if we allow the endpoints of the tangle 
to move, it can be completely untwisted until it becomes the [0] or [o>] tangle. 
In Figure 4.3.10 there are some tangles that fail to have this property when 
we use the sum operation with reciprocal tangles or multiplication with 
integer tangles. 


Exercise 4.3.8. Identify the tangle sums from Figure 4.3.10 that are not 
rational. 


Figure 4.3.10: Four examples of tangle sums and products, not all of which 
are rational. 
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Exercise 4.3.9. Identify the tangle in Figure 4.3.1 that is not a rational tangle. 


Exercise 4.3.10. For each tangle, determine whether or not it is rational. 
@ (81+ ((a*)) «5)) +r 
G3) (+ (91+ + 8)) +e) +h 


With the next exercise and definition, we set the stage to uncover some 
surprising results about the vertical and horizontal flip of a rational tangle. 


Exercise 4.3.11. Prove that integer and reciprocal tangles are equivalent to 
their flips. That is, prove the following four equivalences. 


1. ( n|)V fhe ~ n| 


2.1 in} ) #4 ~ [n] 
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Definition 4.3.12. Given a tangle of the form [+1] + P, the horizontal flip of 
P that untwists [+1] and yields the eqitvarent tangle P# flip + [+1] is called a 
flype. ania: for a tangle of the form => * P. , the vertical flip of P that 
untwists =z and yields the equivalent ale PVI4NP » Ey is also called a flype. 
Two of these four flype equivalences are depicted in Figure 4.3.11. 


Figure 4.3.11: The flype equivalences. 
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In Exercise 4.3.11, we saw that the flips of integer and reciprocal tangles are 
equivalent to themselves. Perhaps surprisingly, all rational tangles are 
equivalent to their flips! Also, another interesting result follows as a corollary 
of the flip equivalence, namely that tangle addition and multiplication are, in 
a sense, commutative. 


Theorem 4.3.13. [The Flip Theorem] For a rational tangle P, 
Pw~PVYitp and Pw PHF, 


Corollary 4.3.14. For rational tangle P and an integer m, 


1 1 
P+[mj)~[m]j+P and P«x—~—xP 
[m]  [m 


The next exercise and the example in Figure 4.3.12 help us build intuition 
that will lead to the proof of The Flip Theorem and its corollary. 


Exercise 4.3.15. Use the flype equivalences to prove that the following 
statements hold for all integers n and m. 


(i) + py ~ py tl, and 
(ii) [el * fay ~ tg * [r 


Figure 4.3.12: An example showing P# flip ~ P, for a rational tangle, P. 
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As seen in Figure 4.3.12, for the rational tangle P iy « ( i 3), its 


horizontal flip is given by P”/"? = (ea 3}) * ty Next we use the flype 
equivalences of type (a) and (b) from Figure 4.3.11. First, we apply a flype of 
type (b) to P#/"?, moving the factor +; from the bottom to the top of the 
figure. Then, three flypes of type (a) transform |3| + yz to -y + (3), and this 


completes the diagrammatic argument showing P# flip ~ p, 


Exercise 4.3.16. Using the techniques shown in Figure 4.3.12, write a 
sequence of diagram equivalences that show Q ~ QV’? for 


Q=(47* (3 + m)) +[-2). 


Next we will prove The Flip Theorem using the intuition we have gained 
from Figure 4.3.12 and Exercise 4.3.16. 


Exercise 4.3.17. Prove The Flip Theorem. (Hint: Use induction on the 
number of crossings in P. For the inductive step there will be four cases to 
consider: P = Q + [+1], P = [41] + Q, P = Q* ay, and P = sy * Q. Notice that 
Q is a rational tangle with one fewer crossing than P.) 


Exercise 4.3.18. Use The Flip Theorem to prove Corollary 4.3.14. 


The next equivalence is simple, but eye opening! Recall from part 3 of 
Exercise 4.3.7, we recognized that a reciprocal tangle 7 is the inverse of the 
integer tangle [a], and vice versa. In Theorem 4.3.19, we recognize the 
product ? « + as the inverse of a sum. This equivalence is a key step in 
understanding the term rational is used in the name of rational tangles. From 
here onward, we use the notation + instead of P'. 


Theorem 4.3.19 (The Product-to-Inverse Equivalence). For a rational tangle 
P and an integer a, 
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Exercise 4.3.20. Prove Theroem 4.3.19. (Hint: Draw the diagrams within the 
claimed equivalence and provide a Reidemeister sequence of diagrams that 
prove they are equivalent tangles.) 


The Product-to-Inverse Equivalence will allow us to transform our notation 
for a rational tangle in a very important way; given the notation for a rational- 
tangle-twist form we can change it into notation that looks like a continued 
fraction. This change is accomplished by replacing each product expression 


with the equivalent inverse expression. As our notation for a rational tangle 
becomes more like a rational number, we will get closer to understanding a 
powerful association between rational tangles and rational numbers. 


First, let’s use an example to clarify the notational-transformation process of 
replacing product expressions with the equivalent inverse expressions. In this 
example, we use Corollary 4.3.14 and Theorem 4.3.19 repeatedly, and the 
results from Exercise 4.3.7. 


Let’s start with the notation for a particular rational tangle in twist form, 


(5) + (a ‘ (13 n = +l « nia) " -8)) | 


Using the Product- - Inverse Equivaleice we can transform 7; * 7’ into the 
equivalent tangle 7.= iF -, where 7’ = {3] + (a * [6] * 0) +|-8). Hence, 


] ] 
Bl + (Fy * (81+ ( * U6) * 7) + -8))] 
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Next, from commutativity of adding [—8] and the commutativity of 
multiplying by =, and from using that [3] + [-8] ~ [-5] and 4 * ny ~ pa, we 
have the following two equivalences. 


Using the Product-to-Inverse Equivalence again, this time on the product 


a 
6) * —~, we have 


[12] 


Now, our rational tangle that was once in twist form looks rather like a 
continued fraction! 


Exercise 4.3.21. (a) Use Corollary 4.3.14 and the Product-to-Inverse 
Equivalences to transform the rational tangle below into an expression that 
looks like a continued fraction. 


yt (i+ (gye(2l+ y+ to» Ha) 


(b) Compare your continued fraction from part (a) to the preceding example. 
The notation should suggest a relationship between the rational tangles 


6+ (qe (i+ (gel pq) +L8)) a 
5 : (i + (a ‘ (1 + a " 0) , =) | 


Using this example for inspiration, conjecture and prove a more general 
tangle relationship. 


Given a rational tangle in twist form, the process from Exercise 4.3.21 and 
the preceding example can be applied to write the tangle in what is called 
continued fraction form. 


Definition 4.3.22. A continued fraction form of a rational tangle is an 
algebraic description of a rational tangle by a continued fraction of integer 
tangles as shown below 


a + 
Fell poe 
[a2|+ Tal 
where each q, is a nonzero integer, except possibly a,, which could be zero. 
Notice that the integer tangle [a,,] is the last tangle in the construction of T. 
The twists in [a,] are the portion of the rational tangle where untwisting 


could start if one were to deconstruct the rational tangle. On the other hand, 
the tangle [a,] denotes the tangle to which all other integer and reciprocal 
tangles must be summed and multiplied; that is, [a,] is the first tangle in the 
construction of T. When drawing a rational tangle T using its continued 
fraction notation, it is helpful to notice whether [a,] is an integer or reciprocal 
tangle when T is given in twist form. Since the integer tangles in continued 
fraction notation alternate between reciprocal and integer tangles, we can 
simply use the parity of n (the number of integer tangles in the continued 
fraction form of T) to determine whether [a] is an integer or reciprocal 
tangle in the twist notation for T. 


We use these observations in the following exercise. 


Exercise 4.3.23. Draw the rational tangle diagram from its continued fraction 
form. 


The continued fraction form of a rational tangle motivates the following 
association between continued fractions and rational tangles that was first 
seen to be a complete invariant for rational tangles by John H. Conway in 
1970. 


Definition 4.3.24. For a rational tangle T, with continued fraction form 
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[an] + 
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we defined the fraction of T, denoted F(T), by 
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Theorem 4.3.25. [Conway’s Theorem] Two rational tangles are equivalent 
if and only if their associated fractions are equal. 


Conway’s Theorem implies that the fraction defined above is a complete 
invariant of rational tangles. That is, it tells us exactly when two tangles are 
and are not equivalent. As an example, consider the two continued fractions 
below. 


Exercise 4.3.26. Show the two fractions below are equal. 
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The two fractions in Exercise 4.3.26 are the fractions associated to the 
rational tangles, 


. 1 ; 1 
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[-2] + [2] + By 

Conway’s Theorem implies that, since the two numerical fractions are equal, 
the two rational tangles are equivalent. Figure 4.3.14 shows this through a 


sequence of equivalences (planar isotopy, Reidemeister moves, and flypes). 


Exercise 4.3.27. For each equivalence in Figure 4.3.14, identify whether it is 
a planar isotopy, a flype, or sequence of several Reidemeister moves. 


Exercise 4.3.28. Use Conway’s Theorem to determine whether or not the two 
rational tangles shown in Figure 4.3.13 are equivalent. 
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Exercise 4.3.29. Use Conway’s Theorem to determine whether or not the two 
rational tangles shown in Figure 4.3.15 are equivalent. 


Figure 4.3.14: A sequence of tangle equivalences showing 


((4)* 45) +031 ~ (i3)* 4) +02. 


Figure 4.3.15: Are these rational tangles equivalent? 
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In what follows, our aim is to prove half of Conway’s theorem, namely that if 
two rational tangles have equal fractions, then the rational tangles are 
equivalent. We direct the interested reader to several excellent sources that 


also prove the converse, that the fraction of a rational tangle is a well-defined 
function on equivalence classes of rational tangles [12,23]. 


Theorem 4.3.30. If two rational tangles have equal fractions, then the 
tangles are equivalent. 


To prove that equal fractions imply equivalent tangles, we will use three 
ingredients: a unique representation of a continued fraction, an algebraic 
equality due to Lagrange, and a tangle equivalence that we will call 
“Lagrange’s equivalence.” 


We begin with a fact about the unique representation of continued fractions. 
Every fraction, 5, has a unique representation as 
p 1 such that all integers aj;, 
—-=aA,+ 
G 
d An-1 + 


1 for i <n, are positive, 


z , > ) a . Tra ( + 7, YY 
1 dp, could be negative or zero 
Qn—2 + ee + — i 

ay and a, is greater than 1. 


Such a continued fraction, written with all nonnegative integers except 
possibly a,,, will be called the regular form continued fraction of 5. Notice 
that within a regular form continued fraction, the expression a a is 
always positive and less than 1. This means that a, must be the greatest 
integer less than or equal to 5. Hence, given a fraction = the value of a, in its 


regular continued fraction form is unique. The following exercise should help 
make the existence and uniqueness of the remaining integers a,j, d,9,---; 


a, seem plausible as well. 


Exercise 4.3.31. For each of the three continued fractions below, show that it 
is equal to a regular form continued fraction. (Hint: First write the fraction as 
©, then determine the appropriate value for a,. Next, determine a,,_, > 0, a,» 


> 0, and so on. Reflect on your algorithmic process to construct an argument 
for why the resulting regular continued fraction representation of is unique.) 


Next a useful formula, that is attributed to Lagrange, is given by 


1 ] 
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Exercise 4.3.32. Use algebra to prove that the formula of Lagrange, stated 


above, is a valid formula. 


This formula, used first by Goldman and Kauffman in the proof of Conway’s 
theorem [12], can be used to transform a continued fraction that is not in 
regular form into one that is in regular form. Observe that the negative term 

-; on the left of the equation is transformed into an expression including the 
nonnegative term +; on the right side. Thus, the right side of the equality 
has one fewer negative sign. A partial example that illustrates this 
transformation is shown below. 


Consider the continued fraction (a) from Exercise 4.3.31, which is not in 
regular form. To partially transform this expression into its regular form, we 
first factor out the negative signs so that they become subtraction operations 
in the continued fraction expression. This is needed for the application of 
Lagrange’s formula, as the left side of the equation involves a difference with 
the expression + rather than the addition of =. 


Notice that, after factoring out the two negative signs in front of the 3 and 4, 
we have a total of three subtractions. This means that we need only to apply 
Lagrange’s formula at most three times. We begin with the subtraction at the 
bottom of the continued fraction. 
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Exercise 4.3.33. Use Lagrange’s formula twice more on the previous 
example to find the regular form of this continued fraction. Note that your 
final expression should be the same continued fraction as was found in 
Exercise 4.3.31. 


Exercise 4.3.34. Use the Lagrange formula to transform the continued 
fractions (b) and (c) from Exercise 4.3.31 into regular form. (Hint: Factor all 
negative signs out of fraction denominators to become subtraction signs. 
Then apply Lagrange’s formula multiple times from the bottom upward.) 


Last, we state and then prove a tangle equivalence inspired by Lagrange’s 
equality; not only does Lagrange’s formula hold for continued fractions, but 
it also holds for rational tangles! 


Theorem 4.3.35 (Lagrange’s Equivalence for Rational Tangles). For an 


integer tangle [P| and a rational tangle [Q], 


P\- a ~[P-1+ 


[Q) [Z| + OR 


Q)+[-1] 

Exercise 4.3.36. Prove Lagrange’ s Equivalence for Rational Tangles. (Hint: 
Draw the diagram for |” ~ |. + ro so by first drawing [Q] + [-1], then 
invert it, and so on. Next, draw the diagram for |?| - lal yj. Find a short sequence 
of equivalences (using planar isotopies, flypes, flips, or a sequence of 
Reidemeister moves) that transform the more complicated diagram into the 
simpler. State the name of each step of your equivalence as a planar isotopy, 
flype, flip, or sequence of Reidemeister moves. For inspiration, you may wish 
to look at the equivalences in Figure 4.3.14.) 


The last step in the proof of Theorem 4.3.30 is to put these ideas together. We 
Start with the assumption that T and R are two rational tangles whose 
associated fractions both equal 5. Denote the continued fraction form of T and 
R by 
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Figure 4.3.16: The hypothesis of Theorem 4.3.30. Suppose T and R are 
rational tangles such that F(T) =~ = F(R). 
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Exercise 4.3.37. Write a formal proof of Theorem 4.3.30. (Hint: Referring to 
Figure 4.3.16, what happens when Lagrange’s equality is repeatedly applied 
to the continued fraction F(T) with the goal of transforming it into a regular 
form continued fraction? What happens when Lagrange’s equality is 
repeatedly applied to the continued fraction F(R) with the goal of 
transforming it into a regular form continued fraction? How does this relate to 
52 To prove T ~ R, it suffices to prove T ~ Q and Q ~ R for some rational 
tangle Q. Figure 4.3.17 should inspire both the selection of Q and the means 
of proving that both T and R are equivalent to Q.) 


Figure 4.3.17: The desired conclusion of Theorem 4.3.30 is to show that T ~ 
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Now we will make some connections back to the world of knots and links by 
defining rational knots and links. There are two natural ways to connect the 
strand-ends of a tangle to create a knot or link. They are called the 
numerator and denominator closures of T, denoted by N(T) and D(T), 


respectively, and depicted in Figure 4.3.18. 


Figure 4.3.18: The numerator and denominator closures of a tangle, T. 


N(T) = D(T) = 


Definition 4.3.38. A knot or link, L, is called rational provided that there 
exists a rational tangle T such that either N(T) or D(T) is equivalent to L. 


Exercise 4.3.39. Prove or disprove each of the following statements. 
1. Fora tangle, T, N(T) = D(T’). 
2. Fora tangle, T, D(T) = N(1"). 
3. For a rational tangle T, N(T* yy) = N(Y). 
4. For a rational tangle T, D(T + [a]) = D(1). 


Parts 3 and 4 of Exercise 4.3.39 imply that if K is a rational link such that K = 
N(T) for a rational tangle T, then we can assume that the last tangle 
algebraically contributing to T is an integer tangle. Similarly, if K is a rational 
link such that K = D(T) for some rational tangle T, then we may assume that 
the last tangle algebraically contributing to T is a reciprocal tangle. 


As we mentioned at the beginning of this section, Conway notation is a 
useful notation for encoding rational links and knots. In fact, Conway used 
his notation (for rational and nonrational knots) to create a list (by hand) of 
all prime knots with 11 or fewer crossings. 


Definition 4.3.40. Suppose the knot K is equivalent to the numerator closure 
of a rational tangle, T, with continued fraction form 


T = |an| + 
| ! [an—1] +---+ 
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such that each a; is a nonzero integer. The Conway notation for K is given 
by the string of integers [[a, a>... a,]]. 


For example, the Conway notation [[4 —2 3 1 5]] encodes a knot whose final 
summand (before numerator closure) is the integer tangle [5]. Since the 
numbers listed alternate between integer and reciprocal tangles, the 3 and 4 
values in the notation also represent integer tangles while the —2 and 1 
represent reciprocal tangles. Hence the notation [[4 —2 3 1 5]] encodes the 


rational knot 
K=N (((( x a) + 3) x a) i) 


Similarly, to decode the notation [[3 6 —4 —3]], we observe that the final 
summand of the rational tangle is the integer tangle [—3]. Hence, within the 
notation [[3 6 —4 —3]], the 3 and —4 represent reciprocal tangles and 6 and —3 
represent integer tangles. So [[3 6 —4 —3]] encodes the rational link 


Ls (q+ i) ‘oa) +[-3]). 


Exercise 4.3.41. The Conway notation for all nontrivial prime knots with 
seven crossings is given below. Draw the knots that correspond to each 
sequence. 


As a corollary of Conway’s Theorem we have the following. 


Corollary 4.3.42 (Conway). Let L, and L, be rational links. Then L, and L» 
are equivalent if their continued fractions are equal. 


Exercise 4.3.43. Prove Corollary 4.3.42 using Theorem 4.3.25. 


Note the difference between Conway’s Theorem and its corollary. Not only is 
every instance of “tangle” replaced by “link,” but there is another difference 
as well. Can you spot it? The corollary implies that there might be equivalent 
rational links that have different associated continued fractions. Perhaps this 
makes sense, because once we've closed a tangle, we no longer have any 
endpoints we need to keep fixed. We have a bit more freedom in how we 
might manipulate our link diagram. While there is indeed more freedom, we 
state an incredibly useful result, proven by Schubert, that completely 
characterizes rational link equivalence [36]. 


Theorem 4.3.44. [Schubert’s Theorem] Suppose L, is a rational link with 
reduced fraction = and Ly is a rational link with reduced fraction ®.. Then 
L,and L, are equivalent if and only if 


1. p=p' and 
2. either q = q' (mod p) or qqd' = 1 (mod p). 


Exercise 4.3.45. Show that the rational links associated to the sequences [[2 3 
4] and [[4 3 2]] are equivalent but that the rational tangles associated to these 
sequences are not equivalent. 


Exercise 4.3.46. Show that for any nonzero integers a, b, and c, the rational 
link [[a b c]] is equivalent to the rational link [[c b a]]. (Challenge: Can this 
result be generalized to rational “palindromic” sequences with more than 
three terms, i.e., links of the form [[q, a ... a,]] and [[a,... a9a,]]?) 


Chapter 5 


Combinatorial Knot Invariants 


We discussed the idea of a knot or link invariant in Section 2.4 as a means of 
proving that two links are not equivalent. In Chapter 5, we will consider some 
examples as well as a nonexample of knot and link invariants. 


5.1 The Writhe of a Diagram 


We begin our exploration by recalling the notion of the sign of a crossing 
from Definition 3.4.6, pictured again in Figure 5.1.1 below. 


Figure 5.1.1: Negative and positive crossings. 
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Definition 5.1.1. The writhe w(D) of a link diagram D is the sum of the signs 
of the crossings in the diagram. 


Exercise 5.1.2. Choose an orientation for each of the knot diagrams in Figure 
5.1.2, and then compute the writhe. Once you've done this, change all 
orientations and compute the writhes of the oriented knot diagrams with 
reverse orientations. 


Figure 5.1.2: Determine the writhe of these knots. 
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Exercise 5.1.3. Draw a picture of your favorite knot. Give it an orientation 
and compute the writhe. Now, give your diagram the opposite orientation. 
Compute the writhe of this diagram. What do you notice? Formulate and 
prove a conjecture about the relationship between the writhe of an oriented 
knot diagram D and its reverse). 


Exercise 5.1.4. 1. Draw a picture of your favorite 2-component link. Give it 
an orientation and compute the writhe. 


2. Now, give one of the components of your diagram the opposite 
orientation. Compute the writhe of this diagram. What do you notice? 


3. Now, change the orientation of the second component and compute the 
writhe of the resulting diagram. What do you notice? 


4. Formulate and prove a conjecture about the relationship between the 
writhe of an oriented link diagram D and the writhe of link diagrams 
obtained from D by reversing the orientation of (a) one component or 
(b) both components of the diagram. 


Exercise 5.1.5. Consider the torus knot T; ,, where n is a positive odd 
number. What is the writhe of T; ,,? 


Exercise 5.1.6. Consider the standard rational link diagram D with Conway 
notation [[a b c]]. What is the writhe of D ? 


Let’s investigate whether or not the writhe is a link invariant. Recall from 
Chapter 2 that two diagrams represent the same link if and only if the two 
diagrams are equivalent by a sequence of Reidemeister moves and planar 
isotopies. Therefore, if we can prove that the application of the Reidemeister 
moves doesn’t change the writhe, then we will have shown that the writhe is 
an invariant. We'll start our investigation with an analysis of the writhe for a 


Reidemeister 2 move. 


Suppose that a Reidemeister 2 move or its inverse is applied to a link 
diagram. We will show that the local portion of the link diagram contributes 
the same value to the writhe before and after the move is performed. 


Consider Figure 5.1.3 and observe that the two crossings are signed +1 and — 
1, regardless of how the arcs are oriented. Thus these two crossings 
contribute a zero sum to the writhe of the diagram. After the Reidemeister 2 
move is applied, both crossings are removed. Thus, locally, this portion of the 
diagram still contributes zero to the writhe. Therefore the writhe is invariant 
under a Reidemeister 2 move. 


Exercise 5.1.7. Finish the investigation into whether or not the writhe of an 
oriented link diagram is an invariant by considering the following questions. 


Figure 5.1.3: Before and after a Reidemeister 2 move. 


\ J a —__ 
<a a. 


1. Is the writhe invariant under the Reidemeister 3 move? (To answer this, 
consider the sums of the signs of the crossings in a link diagram just 
before and just after a Reidemeister 3 move has been performed.) 


2. Is the writhe invariant under the Reidemeister 1 move? 


3. What can you conclude? Is the writhe a link invariant? In other words, 
is the sum of the signs of the crossings in a link diagram preserved by 
all Reidemeister moves? 


At this point, you may be wondering how useful the writhe is. We will 
discover in Chapter 6 that the writhe, given the way it behaves under the 
Reidemeister moves, can help us to define one of the most widely used knot 
and link invariants. In addition, one restriction of the writhe provides a 
simple, but useful, virtual knot invariant in Chapter 8. For now, we will 


investigate another restriction of the writhe for links that yields a fundamental 
link invariant. 


5.2. The Linking Number 


We now have a well-defined notion of the sign of a crossing in a link 
diagram, but how can we harness this sign convention to develop link 
invariants? The answer is quite simple. For a two-component oriented link, 
we define a link invariant called the linking number. 


Definition 5.2.1. Let J and K be two components of an oriented link diagram. 
The linking number of the link formed by J and K (ignoring any other 
components the link may have), denoted Ik (K, J), is half the sum of the signs 
of the crossings where J and K cross. 


Exercise 5.2.2. Find at least one pair of link diagrams from Table 1.1 that do 
not have the same linking number. 


Exercise 5.2.3. Show that the oriented Whitehead link, shown in Figure 
5.2.1, has linking number 0. 


Figure 5.2.1: The oriented Whitehead link. 


Exercise 5.2.4. Construct three examples of link diagrams with three 
different nonzero linking numbers. 


Next we prove that the linking number is indeed a link invariant. 


Theorem 5.2.5. If D and D’ are two diagrams of a two-component link L, 
then the linking numbers of D and D’ are equal. 


Exercise 5.2.6. Prove Theorem 5.2.5. (Hint: Show that the linking number is 
unchanged by the application of R1, R2, and R3 moves.) 


Exercise 5.2.7. Consider the torus link T> ,, where n is a positive even 
number. In this case, T> ,, is a two-component link. What is the linking 
number of T, ,, if both components are oriented in the same direction? What if 
the two components are oppositely oriented? 


Exercise 5.2.8. Find a family of two-component links (other than the T, ,, 


example) such that, for every integer n, there is exactly one member of the 
family with linking number n. 


The definition of linking number makes it clear that lk (J, K) and lk (K, J) are 
equal. However, it isn’t clear why the linking number is always an integer 
(rather than sometimes equal to half an integer). We will prove that the 
linking number is indeed always an integer through the following sequence of 
lemmas. 


First, note that the sum in the definition of the linking number above can be 
split into the sum of the signs of the crossings where the K component 
crosses over the J component, and the sum of the signs of the crossings where 
the J component crosses over the K component. Let’s call these subsums the 
K over J sum and the J over K sum and denote them by 2,; and Xx, 


respectively. Using our sum notation, we have the following. 


Ik( J, K) 


) sign(c) 
c, a crossing of J with K 
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Our first lemma states that the two ‘oversums’ defined above are unchanged 
by Reidemeister moves. In other words, both X,,; and Xx,; are invariants in 


their own rights. 


Lemma 5.2.9. For a link with two components K and J, 
(i) the K over J sum is unchanged by Reidemeister moves, and 
(ii) the J over K sum is unchanged by Reidemeister moves. 


Exercise 5.2.10. Prove part (i) of Lemma 5.2.9. Part (ii) follows by the same 
argument. 


Next, we investigate how changing a crossing between two components of an 
oriented link diagram impacts the difference between the two oversums. 


Lemma 5.2.11. Let K and J be the components of a link diagram D. Suppose 
one crossing of component J with component K is changed, resulting in a 
new diagram D'. Then the difference Xx ,; — Xj does not change value, as D 


is changed to D'. 


The following lemma looks at a special case. Suppose all the crossings of K 
with J are crossings of K over J. What would the link look like in this case? 
Are the two components of the link actually linked? 


Lemma 5.2.12. If the crossings in an oriented link diagram are such that K 
always passes over J, then the difference Xx, — Xx,7 is Zero. 


Exercise 5.2.13. Prove Lemma 5.2.12. (Hint: Notice that this link can be 
deformed so that K and J have disjoint projections.) 


Theorem 5.2.14. The linking number is always an integer. 


Exercise 5.2.15. Prove Theorem 5.2.14. (Hint: Put the previous two lemmas 
together to make conclusions about an arbitrary link with two components. 
Start with a diagram D of an arbitrary link of two components J and K. 
Consider the diagram D’ identical to D except that all crossings of K with J 
are crossings of K over J. By Lemma 5.2.12, what does the difference X47 — 


Xz equal? Now change D' back to D, one crossing at a time and use Lemma 
52-11.) 


Notice that the argument in Exercise 5.2.15 also proves that the linking 
number can be defined via either of two oversums, i.e., 


GK = =>. 


J/K K/J 


5.3. Tricolorability 


We just explored an example of a quantity that fails to be an oriented knot or 
link invariant as well as a quantity that is an invariant of oriented links with 
two components. We don’t yet have an example, however, of an invariant 
that is defined for knots. So let’s explore the idea of a knot coloring. The 
colorability of a diagram is a new genre of invariant that can be explored both 
for knots and for links. 


Definition 5.3.1. A knot diagram is called tricolorable if each arc in the 
diagram can be drawn using one of three colors, say red (R), yellow (Y), and 
blue (B), in such a way that the following two conditions hold. 

1) At least two colors are used in the diagram. 

2) At each crossing, either all arcs are colored the same or all arcs are 
different colors. 


Exercise 5.3.2. (a) Finish assigning colors to the black arcs of the diagram in 
Figure 5.3.1 to give a valid tricoloring of the diagram. (b) Determine which 


diagrams in Table 1.1 are tricolorable. 


Figure 5.3.1: An example of a partial tricoloring of a knot diagram. 
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Exercise 5.3.3. Which of the diagrams in Figure 5.1.2 are tricolorable? 
As mentioned above, tricolorability is a link invariant. We prove this by 
showing it is impossible for one diagram of a knot K to be tricolorable while 


another diagram of the same knot is not tricolorable. 


Theorem 5.3.4. If a given diagram of a knot, K, is tricolorable, then every 
diagram of K is tricolorable. 


Theorem 5.3.4 ensures that the following definition makes sense and gives 
our first example of a knot invariant. 


Definition 5.3.5. A knot is called tricolorable if its diagrams are tricolorable. 


Exercise 5.3.6. Prove Theorem 5.3.4. (Hint: Show that the tricolorability of a 
diagram is unchanged by the application of R1, R2, and R3 moves.) 


Exercise 5.3.7. Prove that the trefoil knot is not equivalent to the unknot. 


Exercise 5.3.8. For which values of n is the torus knot T, ,, tricolorable? 


Exercise 5.3.9. Explore the tricolorability of pretzel links. Make a conjecture 
and prove it. 


Exercise 5.3.10. Are either of the closed braids in Figure 3.4.5 tricolorable? 


5.4 A Generalization of Tricolorability 


In Section 5.3, we saw how tricolorability is used to distinguish between 
knots. In this section, we will generalize this idea to a new invariant that uses 
more than three colors. Instead of adding more colors such as magenta, 
chartreuse, burnt sienna, and the like, we think of numbers as being ‘colors.’ 
That is, we label the strands in our diagram with numbers from the set L,, = 


{0, 1, 2,...,n-—1}. For tricoloring, in particular, we use numbers in the set 
{0, 1, 2} instead of our red, yellow, and blue palette. 


To define generalized n-colorability we mimic the first condition in the 
definition of tricolorability that at least two labels from L,, are used. In 


generalizing the second half of the definition, however, we have more 
freedom. One way to interpret the condition that ‘at each crossing, either all 
arcs are colored the same or all arcs are different colors’ is to say that once 
two of the three strands in a crossing have been labeled, say with labels a and 
b, then the label of the third strand should be determined by some formula 
depending on a and b. The simplest type of formula would be a linear 
formula, such as c = Xa + Yb, where c is the label of the third strand and X 
and Y are coefficients selected to ensure that n-colorability is a well-defined 
invariant. 


Figure 5.4.1: At a given crossing, the strands are colored with numbers a, b, c 
from L,. 
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The next exercise investigates the values of X and Y in the linear relationship 
c=Xat+ Yb. 


Exercise 5.4.1. Consider the labels a, b, c of the strands involved in a 
crossing as shown in Figure 5.4.1. 


1. Notice that the color a is on the overstrand, while b and c are on the 
understrands. If a and b are known, the linear formula c = Xa + Yb 
should give the value of c. However, by symmetry, if a and c are 
known, the formula b = Xa + Yc should give the value of b. Use these 
two equations to find an integer value of Y. 


2. Now, plug the integer that you found for Y above into the equation c = 
Xa + Yb so that we can determine what integer X ought to be. Note that 
the resulting linear equation needs to be satisfied if a = b = c since 
monochromatic colorings of crossings are required for R1 invariance of 
colorability. Use this constraint to determine what X must be. 


Another constraint on the linear formula c = Xa + Yb is that c should be one 
of the numbers from L,, = {0, 1, 2,...,n-—1}. This next exercise investigates 


what can be done to address this issue. 


Exercise 5.4.2. Return to the previous definition of tricolorability, but use the 
‘colors’ 0, 1, and 2 instead of red, yellow, and blue. Consider the six 
crossings in Figure 5.4.2. 


1. For each crossing in Figure 5.4.2, the definition of tricoloring requires 
the value of c to be the color that is not used on the other two strands. 
Determine the value of c that is forced by this constraint for each 
crossing. 


2. In each of the six crossings, use the linear relationship c = Xa + Yb with 
the values of X and Y found in Exercise 5.4.1 to determine the integer 
that c must equal. 


3. Compare the results of the previous two tasks. How can c be viewed so 
that the value of c found using our original tricoloring definition 
coincides with the requirements forced by the linear formula? (Hint: 
Modular arithmetic is helpful here.) 


Exercises 5.4.1 and 5.4.2 lead to one possible generalization of coloring with 


more than three colors, as is given in the following definition. 


Definition 5.4.3. Let p be an odd prime. A knot diagram is p-colorable if 
each arc of the diagram can be labeled with an integer from 0 to p — 1 such 
that: 

1) At least two labels are distinct. 

2) At each crossing, the relation 2 x — y— z = 0 (mod p) holds, where x is the 
label on the overstrand and y and z are the other two labels. 


Figure 5.4.2: A figure of six crossings labeled from the set L3 = {0, 1, 2}. In 


each crossing, two of the three strands have been colored and one strand, 
labeled c, is to be determined. 


Exercise 5.4.4. Figure 5.4.3 illustrates a partial 7-coloring of a knot. Check 
that each crossing has a valid labeling and determine the values of a and b. 


Figure 5.4.3: Complete this partial 7-coloring. 


The following is a generalization of Theorem 5.3.4. 


Theorem 5.4.5. If a given diagram of a knot K is p-colorable, then every 
diagram of K is p-colorable. Thus, p-colorability is a knot invariant. 


To prove Theorem 5.4.5, let D be a p-colorable diagram of a knot K. Recall 
that any two diagrams of K are related via a sequence of Reidemeister moves 
and planar isotopies. Since planar isotopies change only the shape of an arc 
and do not change the crossings, it suffices to show that after the application 
of a Reidemeister move, the resulting diagram of K remains p-colorable. 


We begin by applying an R2 move to the diagram D and show that the 
resulting diagram D' is also p-colorable. There are two possibilities we need 
to consider: (1) D' contains the two additional crossings created by the R2 
move; or (2) D contains the two crossings that are removed by the R2 move. 
We consider case (1) first. 


Suppose two arcs in a p-coloring of D are colored x and y as indicated on the 
left of Figure 5.4.4. After the R2 move, the diagram D’ inherits the valid 
coloring of D on all arcs not depicted in Figure 5.4.4. This means that the 
arcs emanating into the rest of the diagram for D' must remain colored x and 
y to ensure the coloring remains valid elsewhere in the diagram. As D was 
colored using more than two labels, D' also uses more than two labels. 
Therefore, a valid coloring exists for D’ if and only if there exists a numerical 
color for the arc c that results in a valid coloring on the two new crossings 
created by the R2 move. 


Exercise 5.4.6. Determine the numerical color of arc c in Figure 5.4.4 so that 
the two crossings adjacent to arc c have valid p-colorings. (Hint: The 
numerical color for c will depend on x and y.) 


Figure 5.4.4: A p-coloring before and after an R2 move. 


Exercise 5.4.7. Finish the proof of Theorem 5.4.5. That is, complete the 
proof for the R2 move, case (2), and complete the proofs for the application 
of R3 and R1 moves. 


Exercise 5.4.8. Determine which knots with six or fewer crossings can be 5- 
colored. For each example, exhibit a 5-coloring. (Figure 2.3.5 contains 
diagrams of all knots with six or fewer crossings.) 


Exercise 5.4.9. For which primes p can the trefoil knot diagram be p- 
colored? (Hint: Label the three strands x, y, and z, and then write out the 
system of equations that must be satisfied by a valid coloring. For which 
primes p can you solve this system?) 


Exercise 5.4.10. The p-coloring invariant can be used to distinguish between 
knots. The knot 8, is both 5- and 7-colorable. However the knots 7, and 4, 


are not. Prove this. Diagrams of the knots 8,¢, 7,;, and 4, are given in Figure 
5.4.5. 


Figure 5.4.5: The knots 8,¢, 7;, and 4,. 


5.9 Matrices, Colorings & Determinants 


In this section, we will use linear algebra to simplify the problem of finding a 
p-coloring of a diagram or proving that no such coloring exists. We will also 
find a new knot invariant along the way. 


To translate the problem of coloring a knot diagram into a linear algebra 
problem, we begin by denoting the color of each arc by a variable. If the knot 
diagram has n arcs, we color them Xj, X9, X3, .. . , X;- For each crossing in the 
diagram, we can write down a linear equation relating these variables as 
follows. If arc x; passes over a crossing where x; and x, are the arcs that come 


together to form the understrand of the crossing, then we write 


24; -—2%;-—2,=0 (mod p). 


A p-coloring of the knot exists if there is a solution, z, to this system of linear 
equations such that the entries of # are not all equal. 


Now the problem of finding a p-coloring has been reduced to solving a 
system of linear equations. As is typical in linear algebra, we will write the 
system of linear equations as a matrix equation. Let’s work through an 
example with the square knot pictured in Figure 5.5.1. Using the given 
labeling, the corresponding system of mod p equations is listed in Figure 
Bhe- 


Figure 5.5.1: The diagram of a square knot that contains six arcs. 
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Figure 5.5.2: The system of equations from the square knot with labels from 


Figure 5.5.1. 
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Exercise 5.5.1. Determine the missing equation in Figure 5.5.2. 


This system of equations is written in matrix form in Figure 5.5.3. 


Notice that there is a one-to-one relationship between the arcs and the 
crossings in a knot diagram. One way to see why this is true is to orient the 
knot and observe that each arc in the diagram emanates from a unique 
crossing. Therefore, in general, a diagram with n crossings will result in an n 
x n matrix. We call this matrix the crossing-arc matrix associated to the 
knot diagram D, denoted by Ap. Notice that each row of the crossing-arc 
matrix is associated to one of the crossings. Hence, every row contains the 
entries —1, —1, and 2 and all remaining entries are zero (except in the case 
where the crossing is an R1 twist, in which case the nonzero entries are —1 


and 1). On the other hand, each column of the crossing-arc matrix is 
associated to one arc of the diagram. Hence, for a given arc, the 
corresponding column contains two —1 entries for the two crossings involving 
the arc’s endpoints. In addition, for each time the arc passes over a crossing, 
the column contains a 2. The remaining entries of the column are 0 (except in 
the case when the arc ends in an R1 twist, in which case the column includes 
an entry of 1). 


Figure 5.5.3: The matrix equation from the system of equations in Figure 
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Exercise 5.5.2. Fill in the bottom row of the crossing-arc matrix in Figure 
5.5.3. Determine the values of a, b, c, d, e, and f. 


Standard techniques of linear algebra apply to solving systems of equations 
mod p. (Formally, for a prime number p, the integers mod p form a field.) 
Unfortunately, the added condition that at least two of the x;’s differ 


introduces a few subtleties that need to be addressed before general results 
can be presented. 


We begin with a few observations. First, note that setting each x; = lisa 


solution to the system of equations in Figure 5.5.3. This is true in general 
because the matrix always consists of rows whose nonzero entries (namely, 2, 
—1, and —-1) sum to zero. (Note, however, that a solution of all 1’s is nota 
solution to the coloring problem because we need at least two distinct entries 
in the vector of colors.) Second, recall that for any matrix equation Ax = 0, if 
w and # are two solutions and k is any constant, then w+ @ and kw are also 
solutions. 


These two observations imply that if there is a solution w = (w), we,...,w,) with 
not all entries equal, then there is a solution 


U = (v1, v2,...,Un) with uv, = 0, ie., 


where j denotes an n x 1 vector with all entries equal to 1. 


Exercise 5.5.3. Prove that if uw = (w;,we,...,w,) is a solution to 4# — ( with not 
all entries equal, then = w + (—w,)1 is another solution such that v, = 0 and 


not all entries are equal. 


Thus, if for a knot K there is a valid p-coloring, then there is also valid 
coloring with v,, = 0. Observe what this implies about the crossing-arc matrix. 


A solution with not all entries equal corresponds to a nonzero solution to the 
system of equations determined by the original matrix with its last column 
deleted. Looking back at the matrix in Figure 5.5.3, this discussion implies 
that to find a p-coloring of the square knot, we want to find a nonzero 
solution of the matrix equation in Figure 5.5.4. 


Figure 5.5.4: The crossing-arc matrix equation from Figure 5.5.3 with the 
sixth column and variable x, both removed. 


Recall, from linear algebra, that six vectors in 325 must be linearly dependent. 
Hence, the six row vectors in the matrix from Figure 5.5.4 must be linearly 
dependent. Therefore, one of the six equations contributing to the matrix 
equation in Figure 5.5.4 is unnecessary, so it is possible to reduce the system 
down to five equations and five unknowns. In general, the same argument 
can be applied to reduce the n x (n — 1) matrix equation associated to a knot 
with n crossings to a square (n — 1) x (n— 1) matrix equation. Which equation 
can we eliminate? In other words, which row of the crossing-arc matrix can 
we remove? 


Let the rows of the square knot’s crossing-arc matrix be denoted by 

F1s72s--+s rg. If, for example, we can show that the sixth row is a linear 
combination of rows 1 through 5, then we can delete the bottom row of the 
crossing-arc matrix. We will show that this is indeed the case by creating a 
specific linear combination of the rows, using only coefficients c; = +1, such 


that the linear combination below, sums to the zero vector. 
C17] + Coro +--+ cerg = 0 


This will prove linear dependence of the rows. As row 6 will have either a 1 
or —1 for a coefficient, this proves it is dependent on the other five rows and 
thus the sixth row can be eliminated. 


Before determining the coefficients c;, we must define a checkerboard 
coloring of a link diagram. 


Definition 5.5.4. Given a link diagram, D, a checkerboard coloring of D, is 
a coloring of the regions of the diagram using two colors, for example the 
colors green and white, such that the green regions are adjacent only to white 
regions, and the white regions are adjacent only to green regions. 
(Sometimes, we forgo the use of specific color names and use ‘shaded’ and 
‘unshaded’ to refer to the two colors for the regions.) 


Every link diagram can be checkerboard colored in two ways. For examples, 
see Figure 5.5.5. 


Figure 5.5.5: Two checkerboard colorings of a link. 


To determine which coefficients should be multiplied by —1 in our linear 
combination, we apply the following clever algorithm. First, orient the knot. 


At each crossing in the diagram put a dot to the right of the overstrand just 
before entering the crossing. Now, checkerboard color the diagram of the 
knot so that the color of the unbounded region of the plane is white or 
unshaded. If the dot for a crossing lies in a shaded region, give the row of the 
matrix corresponding to that crossing a coefficient of —1, or else use a 
coefficient of 1. We call this algorithm the dot-checkerboard algorithm. 


Figure 5.5.6 shows the square knot with a choice of orientation, the 
checkerboard coloring of the diagram, and the dot associated to each 
crossing. 


Figure 5.5.6: A dotted, checkerboard colored diagram of the square knot. 
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Exercise 5.5.5. Prove that every knot diagram can be checkerboard colored. 
(Hint: Can the unknot be checkerboard colored? Can every diagram of the 
unknot be checkerboard colored? Does changing a crossing in a diagram 
impact the validity of a checkerboard coloring? Also, consider using Exercise 
1.2.4.) 


Exercise 5.5.6. Prove that the choice of orientation does not influence the 
outcome of the dot-checkerboard algorithm. 


As indicated by Figure 5.5.6, the rows associated with the three crossings on 
the right-hand side of the diagram, that is, rows 4, 5, and 6, have coefficients 


of —1. The resulting linear combination now sums to the zero vector. 


Exercise 5.5.7. Suppose F\,..., rg are the row vectors from the crossing-arc 
matrix associated to the square knot discussed above. Check the arithmetic in 
the following equality. 


aoe 


ry +ro+73 -—174—-7s — TE = ( 


This proves rg is linearly dependent on rows 1 through 5, which implies the 
bottom row of the crossing-arc matrix can be eliminated without changing the 
solution space of the matrix equation. Removing the bottom row, we now 
have a5 x 5 altered crossing-arc matrix, denoted Ap: as shown in the matrix 


equation in Figure 5.5.7. 


Figure 5.5.7: The altered crossing-arc matrix equation. A nontrivial solution 
of this equation corresponds to a valid p-coloring of the knot diagram. 
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We proceed by applying methods from linear algebra to the associated square 
matrix to study whether or not a knot diagram has a p-coloring. Recall, if A is 
a square matrix, then the equation 4; — ¢ has a nontrivial solution if and only 
if det(A) = 0, or, since we are working mod p, if and only if the determinant is 
divisible by p. The 5 x 5 matrix above has determinant 9. Since 9 is divisible 
by 3, the knot in Figure 5.5.1 can be 3-colored. Since 9 has no other prime 
divisors, this knot cannot be p-colored for any prime p other than 3. 


The following theorem summarizes the discussion above. 


Theorem 5.5.8. Let Ap denote a crossing-arc matrix associated to a knot 
diagram, D, with n arcs. Deleting any one column and any one row of Ap 
yields a new matrix Ap. The knot diagram, D, can be p-colored if and only if 
the matrix equation Apz = 0 has a nontrivial solution mod p. 


Exercise 5.5.9. In the example of the square knot, the last column and the last 
row of the crossing-arc matrix A were deleted. However, Theorem 5.5.8 
states that any one row and any one column of A can be deleted. Explain why 
it does not matter which row and which column are deleted. (Hint: For the 
proof that any column can be deleted, study the solution to Exercise 5.5.3. 
For the proof that any row can be deleted, notice the dot-checkerboard 
algorithm gives a linear combination of all rows in which every row has a 
nonzero coefficient.) 


Recall that we made a choice when assigning the labels x,, X5,..., x, to the 


arcs in our diagram. Changing this assignment of labels will result in a 
different crossing-arc matrix. It is not difficult to see, however, that the 
process above can be applied and will yield the same result. One way to see 
this is to return to viewing the crossing-arc matrix equation as a system of 
linear equations. Notice that the existence of a solution to the system of 
equations has no bearing on the names of the variables used within the 
system. So the assignment of the labels to the arcs in the knot diagram 
doesn’t influence the outcome. 


To complete the proof of Theorem 5.5.8, it suffices to show why the dot- 
checkerboard algorithm always gives a linear combination of the rows of the 
crossing-arc matrix that sums to the zero vector. 


Exercise 5.5.10. Given an arbitrary knot diagram, D, with n crossings, prove 
that the dot-checkerboard algorithm described above results in a linear 
combination of the row vectors of the crossing-arc matrix that sum to the zero 
vector. (Hint: Analyze an arbitrary arc x; in your knot diagram. There are 
several possibilities to consider for x;. First, assume this arc never passes over 
a crossing. (Such an arc doesn’t exist in an alternating knot diagram, but it 
may exist in an arbitrary diagram.) In this case, what are the entries of the i” 
column? In the linear combination produced by the dot-checkerboard 
algorithm, do these entries sum to zero? Next, analyze the arc x;, assuming 


that this arc passes over exactly one crossing. What are the entries of the i” 
column of the matrix? In the linear combination produced by the dot- 
checkerboard algorithm, do these entries sum to zero? Continue your analysis 
of arc x;, assuming it passes over two, three, or four crossings. Once you 


determine a pattern, explain what happens in general.) 


Now that we have a way of translating the p-coloring question into a question 
about the existence of a nontrivial solution of the matrix equation 4,,;— 6, 
let’s recall the powerful relationship between a matrix equation and the 
determinant of the matrix while reviewing some important ideas from linear 
algebra. 


Linear Algebra Review, Fact 1. A nontrivial solution to 4; — § exists if and 
only if A is not invertible, and A is not invertible if and only if det(A) = 0. 
These equivalences still hold when considered modulo a prime p. 


Perhaps surprisingly, the determinant gives us information about the 
colorability of the knot, not merely the knot diagram D. Thus, the determinant 
yeilds another powerful knot invariant. For a knot K and any diagram D of K, 
the determinant of K is the absolute value of the determinant of Ap, the 
associated (n — 1) x (n— 1) matrix obtained from the crossing-arc matrix by 
deleting any one column and any one row. 


Recall, that the determinant of a 2 x 2 matrix A | : : lis given by det (A) = 


ad — bc. For larger matrices, determinants can be calculated in several ways, 
one of which is a recursive process called minor expansion. The ij” minor of 
ann x n matrix A, denoted Mj, is the (n — 1) x (n— 1) matrix obtained from A 
by eliminating the i” row and the j column. Minor expansion can be 
performed along any row or any column of an n x n matrix A. Along the i” 


row, we fix i and use the formula 


n 


det(A) = So(-)* ay det (Mj; ). 
j=1 


To calculate a determinant along the j column instead, we fix j and use the 
formula 


nm 


det(A) _ S0(-1)Maij det (M;;). 
i=l 


Notice that this formula is recursive in the sense that one must still calculate 
the determinant of the minors, Mj. However, the minors are smaller in size 
than A so if the process is repeated, it will eventually end with the calculation 
of the determinant of one or more 2 x 2 matrices. 


Here is a quick example showing the first step in calculating the determinant 
using minor expansion along the fourth row of the matrix M below. 


IZ & OQ 1 
1 0 6 2 
det (7) = det 0 110 3 
$ f OU 


= (—1)°8 det(My,) + (—1)®7 det(My2) + (—1)'0 det(My3) + (—1)811 det(Myq) 


5 0 -l 12 0 -1l im & © 
=-S8det] 0 6 z + 7 det 1 6 2 + 11 det 1 O 6 
1 10 8 0 10 38 0 1 10 


Now there are three 3 x 3 determinants to calculate to complete the 
calculation of the determinant of M. Let’s look at the calculation of the 
determinant of the minor M,, using minor expansion along the first column. 


Zs O 
det} 1 0 6 

0 1 10 

6 0 6 5 0 5 0 
='f_9)2198de _1)31 det _1)40 de 
(—1) i2aet| | io | +! 1)s1adet | io | + 1ytoder| 5 


= 12(—6) + (—1)(50) + 0(30) = —86 


Exercise 5.5.11. Complete the calculation that we started above for the 
determinant of the 4 x 4 matrix M. 


Exercise 5.5.12. Find the determinant of each knot in Figure 2.3.5. Using the 
determinant, find the values of p for which each knot is p-colorable. 


To prove that the determinant of a knot gives a well-defined knot invariant, 


we must show it is invariant under Reidemeister moves. In preparation for 
this, we recall some important observations and theorems from linear algebra. 
These results will be useful to us as we complete several upcoming exercises. 


First, let’s look at a definition. Given an n x n matrix A, the elementary 
matrix that corresponds to a specified row or column operation on A is 
the matrix that is obtained by performing the specified row or column 
operation on the n x n identity matrix. 


Linear Algebra Review, Fact 2. Performing an elementary row 
(respectively, column) operation on a matrix A is equivalent to multiplying A 
on the left (respectively, right) by the corresponding elementary matrix. 


For instance, the row operation “add 5 times row 1 to row 2” (when applied 
to a3 x 3 matrix) corresponds to the elementary matrix 


1 0 O 
EF=/]/5 1 O 
0 0! 


This correspondence is useful because, as Fact 1 implies, the matrix product 
EB, for an arbitrary 3 x 3 matrix B, gives the result of “adding 5 times row 1 
of B to row 2 of B” in a much more succinct expression (You should check 
this product!). Also notice that if we calculate the matrix product in the 
reverse order, BE, then matrix E performs the column operation “add 5 times 
column 2 of B to column 1 of B.” This is to be expected since E can also be 
viewed as the result of this exact column operation applied to the 3 x 3 
identity matrix. So, by multiplying the elementary matrix E on the left of B, 
we perform the row operation corresponding to E, and multiplying E on the 
right of B performs the column operation corresponding to E. 


Here is an example of this observation that includes several row and column 
operations applied to a 2 x 2 matrix A. 


Consider the matrix A : ; | and apply the following row and column 


operations. 


¢ add two times row 1 of A to row 2 


* multiply row 2 by + 
¢ add the negative of row 2 to row 1 
* add —3 times column 1 to column 2 


Exercise 5.5.13. Prove that the list of row and column operations above 
0 -1 


transforms A into B = | 4 <i0 | 


Applying Linear Algebra Review, Fact 2, the 2 x 2 matrix A and the list of 
row and column operations that transform A into B are equivalent to the 
following matrix product equality. 


1 -1 1 0 1 0]f/1 4][0 1 1 3] [oO -1 
0 1 0 5]L2 1],2 0 1 Oo|{0 1] ° [4 -10 
Exercise 5.5.14. (1) Calculate the matrix product 


1 -1)f/1 0][1 0][1 4][0 1 =. 0 -1 ; 
F aie | Alle “Ale ali |to show | § ip | that it 


is equal to 


(2) Comment on the order of the elementary matrices in this product. Why is 


the matrix . the first in the product? Why is | : | last in this 


product? 


Next, we recall a few more facts from linear algebra. You might also want to 
review these results by reading any standard linear algebra textbook. 


Linear Algebra Review, Fact 3. The determinant is multiplicative. That is, 
for k x k matrices A and B, we have the equality det(AB) = det(A) det(B). 


Linear Algebra Review, Fact 4. 


¢ If E is an elementary matrix corresponding to adding a multiple of a 
row/column to another row/column, then det (E) = 1. 


¢ If Eis an elementary matrix corresponding to swapping two 
rows/columns, then det(E) = +1. 


¢ If E is an elementary matrix corresponding to multiplying a 
row/column by the scalar k, then det(E) — k. 


This completes our linear algebra review. Returning to our main goal of this 
section, we can show that the determinant of a knot remains constant after the 
application of each of the three Reidemeister moves. To begin the proof of 
invariance under an R1 move, consider the following notation. Let D be a 
diagram with n crossings and, hence, n arcs. Select an assignment of the arc 
labels x1, Xo, .. . , X,, aS pictured in Figure 5.5.8, so that the R1 move is 


performed on the arc labeled x,,. Suppose, also, that x,, passes over K 


crossings on one side of the R1 move and passes over j crossings on the other 
side of the R1 move (K and/or j may equal 0). After the R1 move, we keep 
the arc labels constant with one exception: the new arc will be labeled x,,,;. 


Figure 5.5.8: Notation for the crossing-arc matrix of a knot before and after 
an R1 move. 
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Using the notation in Figure 5.5.8, we have the two crossing-arc matrices 
shown in Figure 5.5.9. 


Figure 5.5.9: The crossing-arc matrices associated with the two diagrams in 
Figure 5.5.8, with A* denoting the n x (n — 1) matrix containing the crossing- 
arc values from afcs X;,..., Xn_1- 
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Exercise 5.5.15. Prove that the determinant of a knot is unchanged by the 

application of an R1 move. (Hint: Use row and column operations and the 
linear algebra review facts to prove that both matrices in Figure 5.5.9 have 
the same determinant as the matrix in Figure 5.5.10.) 


Figure 5.5.10: Both matrices in Figure 5.5.9 have determinants equal to the 
determinant of the n x n matrix below, up to a factor of +1. 
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Exercise 5.5.16. Prove that the determinant of a knot is unchanged by the 
application of an R2 move. (Hint: Set up your notation with care, as we did in 
Exercise 5.5.15.) 


Exercise 5.5.17. Prove that the determinant of a knot is unchanged by the 
application of an R3 move. 


Chapter 6 


Knot Polynomials 


6.1 The Alexander Polynomial 


The Alexander polynomial, a knot invariant discovered by James W. 
Alexander in 1923, was the first and only polynomial knot invariant for over 
60 years. The polynomial can be calculated by evaluating the determinant of 
a matrix associated to a diagram of a knot. In addition to picking a diagram of 
the knot to compute the Alexander polynomial, many choices are made when 
the matrix is constructed. It is a rather amazing polynomial, but there is much 
to prove when showing that the construction that produces the polynomial is 
an invariant of knots. 


Let K be a knot with an n crossing knot diagram D. Choose an orientation for 
D. Label the arcs and the crossings of the diagram each with the integers 1 
through n. Create an n X n matrix with entries in row w determined by the arc 
labels that meet at crossing w. If, while following the orientation along arc i, 
we encounter arc j to the left and k to the right of crossing w, then the matrix 
entries of row w are 1 —¢ in column i, t in column j and —1 in column k. All 
other entries in row w are zero. If any of i, j, or k happen to be equal, then we 
put the sum of the entries into the appropriate column. 


Notice that this matrix is similar to the matrix we used when calculating the 
determinant of a knot. However, instead of the row w containing the entries 
2, —-1, —1, it contains the linear functions 1 — t, t, and —1. Observe that when 
this matrix is evaluated at t = — 1, we obtain the crossing-arc matrix used in 
calculating the determinant of the knot. Also notice that, just as with the 
matrix that is used to compute the determinant of a knot, the column vectors 
of this matrix sum to the zero vector. 


Figure 6.1.1: The Alexander polynomial matrix entries for crossing w. 
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Definition 6.1.1. Given an oriented and labeled diagram D of a knot K, create 
the matrix Mp whose entries are described in Figure 6.1.1. The (n — 1) x (n—- 
1) matrix obtained by removing the bottom row and last column from Mp is 
called an Alexander matrix of the diagram D. The determinant of an 
Alexander matrix, denoted by A(t), is called an Alexander polynomial of 
the diagram D. The determinant of a 0 x 0 matrix is defined to be 1, and 
thus, the Alexander polynomial of the unknot is defined to be 1. 


The following examples show that the Alexander polynomial as described is 
not quite a knot invariant. It depends on the knot diagram, the orientation, 
and the choices made in the knot labeling. However, we will discover that an 
Alexander polynomial can be normalized to produce a bonafide knot 
invariant! 


Exercise 6.1.2. To calculate the Alexander polynomial of the figure eight 
knot diagram shown in Figure 6.1.2, we begin by selecting an orientation, as 
well as a labeling of the arcs and crossings. One possibility is to make the 
choices shown in Figure 6.1.2. For this diagram, we label the crossings by the 
label of the overstrand arc. 


1. Find the 4 x 4 matrix associated with these choices. 
2. Find the Alexander matrix associated with these choices. 
3. Find the resulting Alexander polynomial of the diagram in Figure 6.1.2. 


Figure 6.1.2: Oriented and labeled figure eight knot. 1 


Exercise 6.1.3. Next, consider the figure eight knot shown in Figure 6.1.3, 
with crossing labels shown. Label each arc according to the crossing that the 
arc passes over. 


1. Find the 5 x 5 matrix associated with these choices. 
2. Find the Alexander matrix associated with these choices. 
3. Find the resulting Alexander polynomial of the diagram in Figure 6.1.3. 


Figure 6.1.3: Labeled figure eight knot with R1 move performed. 


CY 


Exercise 6.1.4. (1) Calculate the Alexander polynomial of the left-handed 
trefoil knot from the knot diagram that has three negative crossings. 

(2) Calculate the Alexander polynomial of the right-handed trefoil knot from 
the knot diagram that has three positive crossings. 

(3) Perform an R2 move on the right-handed trefoil, and then calculate the 
Alexander polynomial for this new diagram. 


As we Saw in the previous exercises, the Alexander polynomial depends on 
the choice of the knot diagram and the choices made when labeling the 
diagram. However, we also observe that the polynomials obtained by making 
various choices differ by no more than a factor of +t* for some integer k. 


Theorem 6.1.5. Suppose D is an oriented knot diagram and consider two 
different labelings of the arcs and crossings of D. Denote the two labeled 
diagrams by D, and D,. The Alexander polynomials of the two labelings 


satisfy Ap(t) = +tXAp,(0 for some integer k. 


Exercise 6.1.6. Prove Theorem 6.1.5. (Hint: A change in labeling of exactly 
two arcs of D corresponds to swapping the two corresponding columns of the 


Alexander matrix. Recall that swapping two columns of a matrix is an 
elementary operation. Find the analogous correspondence for a change in the 
labels for exactly two crossings. When the corresponding elementary 
operations are applied to the Alexander matrix, what is the effect on the 
Alexander polynomial?) 


Theorem 6.1.7. Suppose D and D’ are two different diagrams of an oriented 
knot K. Then the Alexander polynomials of the two diagrams satisfy Ap(t) = 


+tkAp(t) for some integer k. 


Together, we will prove Theorem 6.1.7 in the case where D and D’ differ by 
an R3 move. The remaining cases of Theorem 6.1.7 are left as Exercise 
6.1.10. 


Suppose the oriented diagrams D and D' differ by an R3 move, as shown in 
Figure 6.1.4. Label the arcs for D and D’ identically throughout the diagram 
except in the local region where the R3 move is made. For these local arcs, 
using the labeling scheme shown in Figure 6.1.4. Observe that the arcs that 
extend beyond the local regions are consistently labeled before and after the 
R3 move. Next, we create the two matrices Mp and M7, corresponding to the 
labeled diagrams before and after the R 3 move, as shown in Figures 6.1.5 
and 6.1.6. The matrix B* is identical in both Mp and M7, due to the choice of 


identical labeling for the portions of diagram D and D' that are not pictured. 


Figure 6.1.4: An oriented R3 move with labeled arcs, a,,..., ag and labeled 
crossings C,, C5, C3. 


Exercise 6.1.8. Use Figure 6.1.4 to check the matrix entries in the top three 
rows of Mp and M,,, as shown in Figures 6.1.5 and 6.1.6. Explain why 
column 1, in both matrices, contains only zeros below the third row. Explain 
why rows 1 — 3, for both matrices, contain only zeros in the entries after the 
sixth column. 


Our goal now is to use elementary row and column operations to transform 
Mp into a form that is similar to M7). Initially, we do not want the elementary 


row and column operations to alter B*, so we restrict to using row operations 
on the top three rows, and column operations that add a multiple of column 1 
to some other column. 


Figure 6.1.5: The matrix Mp associated to diagram in D from Figure 6.1.4. 


The submatrix B* contains the matrix entries for the remaining crossings in 
the diagram. 


ay a2 a3 a4 5 ag other arcs 
Cy 0 1-t -1 : & 0 0...0 
Co —1 1-t 0 0 ¢ O © smal 
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Figure 6.1.6: The matrix Mj associated to diagram in D’ from Figure 6.1.4. 


The submatrix B* contains the matrix entries for the remaining crossings in 
the diagram and is identical to B* in Figure 6.1.5. 


a} ay az a4 an a6 other arcs 


Ci 0 1-é -l t 0 0 | ee | 
Co t 1-t QO 0 0 —1 GF sasanaihey 
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Exercise 6.1.9. (a) Apply the following list of row and column operations to 
the matrix Mp from Figure 6.1.5. 


(1) Add (1 — t) times row 1 to row 3. 
(2) Add t times column 1 to column 5. 
(3) Add t times column 1 to column 6. 
(4) Multiply row 2 by ct. 

(5) Multiply row 3 by ct. 

(6) Multiply column 1 by -t. 


Let the matrix resulting from these five operations be denoted by Vp. 


(b) Observe that 7, is now identical to Mj, except in two of the matrix 
entries. 


(c) In the next manipulation, we alter i, and M,, in an identical way so that 
any changes to B* are the same in both matrices. 


(7) Swap the second and last columns of 7. Name the result /},. 
(8) Swap the second and last columns of M,,. Name the result M',,. 


(d) For each matrix manipulation (1) — (8), state the effect it has on the 
determinant of the matrix. 


(e) The Alexander polynomial is obtained by deleting the last column and 
bottom row of \//, and M', and taking the determinant of the resulting 


matrices. Prove that the Alexander polynomials of D and D' differ by no 
more than a factor of +t“. 


Exercise 6.1.10. Prove the remaining cases of Theorem 6.1.7. (Hint: You 
must prove the result holds if D and D' differ by a single Reidemeister move. 
Use Figure 2.3.6 to prove the result holds for a generating set of oriented 
Reidemeister moves. For each Reidemeister move, take special care to set up 
your notation and the matrices Mj and M,,. Find elementary operations that 
can be applied to these two matrices that show they lead to Alexander 
polynomials that differ by no more than a factor of +t*.) 


Next, we investigate the influence of the choice of orientation on the 
calculation of the Alexander polynomial. We will see that the coefficients of 
the Alexander polynomial exhibit a special symmetry. 


Exercise 6.1.11. Let D denote the diagram of the figure eight knot with labels 
and orientation as in Figure 6.1.2. Find the Alexander polynomial of the 
reverse, D. Show that Aj(t~') = +t* Ap(t) for some integer k. 


Theorem 6.1.12. Suppose D is an oriented and labeled diagram and its 
reverse is >. The Alexander polynomial of the diagram and its reverse satisfy 
Ap(t~*) = + Ap(t) for some integer j. 


Exercise 6.1.13. Prove Theorem 6.1.12. 


We can now define the Alexander polynomial of a knot as a normalized 
version of the polynomial that we have been calling the Alexander 
polynomial of a diagram D. 


Definition 6.1.14. Given the Alexander polynomial, Ap(t), for a labeled and 
oriented diagram, D, of a knot, K, factor the polynomial as 


+t*Ax(t) = Ap(t), 


for some integer k, so that A,(t) has a positive constant term. The resulting 
polynomial, A;(t), is the Alexander polynomial of the knot K. 


The polynomial A;(t) is independent of the choice of labeling by Theorem 


6.1.5, is independent of the choice of diagram by Theorem 6.1.7, and is 
independent of the choice of orientation by Theorem 6.1.12. Therefore, the 
Alexander polynomial, A,;(t), is an invariant of unoriented knots. 


Exercise 6.1.15. Calculate the Alexander polynomial of the seven crossing 
knot diagram in Figure 6.1.7. 


Figure 6.1.7: A knot diagram with seven crossings. 
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The Alexander polynomial can be used to distinguish many knots from one 
another, but there are some knots it cannot differentiate. 


Exercise 6.1.16. Show that the knot in Figure 6.1.8 has Alexander 
polynomial 1. 


The knot in Figure 6.1.8 is one of two knots with 11 crossings that has a 


trivial Alexander polynomial. No knots with 10 or fewer crossings has a 
trivial Alexander polynomial, other than the unknot. 


Figure 6.1.8: A nontrivial knot with trivial Alexander polynomial. 


6.2 The Kauffman Bracket & Jones Polynomial 


Another polynomial, discovered in 1984 by Vaughn F. R. Jones, associates a 
Laurent polynomial! with integer coefficients [16] to every knot or link. 
Unlike the Alexander polynomial, the Jones polynomial can sometimes 
distinguish between nonequivalent mirror images of a knot. Indeed, we will 
see that the Jones polynomial can distinguish between the left-handed trefoil 
and right-handed trefoil, that is, the trefoil that can be drawn with three 
negative crossings and the trefoil that can be drawn with three positive 
crossings. The Jones polynomial is a stronger invariant than the Alexander 
polynomial. For example, currently there is no known example of a nontrivial 
knot K whose Jones polynomial is trivial, whereas the knot in Figure 6.1.8 is 
such a knot for the Alexander polynomial. 


We will define the Jones polynomial of a link L through a recursively-defined 
function called the Kauffman bracket [20]. The Kauffman bracket is a 
function, (Z), that assigns a Laurent polynomial to an unoriented link diagram 
L. To compute the Kauffman bracket of a link diagram, we provide a way of 
relating the bracket of a nontrivial link diagram to a pair of brackets of 
simpler link diagrams. Specifically, the Kauffman bracket of a link diagram, 
L,,, with n crossings, for n > 1, is defined to be equal to the value of a certain 


sum, 
A(Li 1) + A (Lp_1), 


involving the Kauffman bracket of two associated link diagrams, each with n 
— 1 crossings. The two simpler link diagrams, 1;*_, and £,,-;, are obtained 
from the original diagram L,, by smoothing at a crossing of L,, in both 
possible ways. In other words, a crossing is replaced by a [0] or [~] tangle. 
This is called Rule 3. 


We observe that when smoothing a crossing in a link, a disconnected, trivial 
unknot diagram may arise. If this happens, we can simplify the bracket of the 
disjoint union L’ U U where U denotes a crossingless diagram of the unknot. 
The bracket (L’ UU’) is replaced by a multiple of the Kauffman bracket of L’, 
namely (— A? — A~?)(L"). Voila! We have Rule 2. 


Finally, we define (U’), the bracket of the unknot diagram with no crossings, to 
be 1. This base case is called Rule 1. 


These three rules are summarized in the following definition. 


Definition 6.2.1. The Kauffman bracket of a link diagram, L, is a 
polynomial in integer powers of the variable A, denoted by (), defined by the 
following three rules. 


1. VA = 

2. our y= 4? ONY 
Fe 4 4 \ + A l 

3. ae _ 
(IX) =47710 O+ AC 


Notice that for any link, we can repeatedly apply Rule 3, reducing the number 
of crossings at each step. We can apply Rule 2 to eliminate disjoint unknotted 
components. Ultimately, we will be left with a single unknot component to 
which we can apply Rule 1. In Exercise 6.2.4, we show that the order in 
which these rules are applied does not affect the final result. Thus, the 
Kauffman bracket is a well-defined quantity assigned to a link diagram. 


First, let’s see these rules in action on a very simple pair of examples. 


Example 6.2.2. We compute the following bracket polynomials: ( (4C) ) and 
(OO). 


(OO )=400 O )+4-%O) (Rule 3) 


= A(—A* — A~?)(C)) + A (CO) (Rule 2) 
=A(-A* —-A“)\ 4.4 (Rule 1) 
_ — As 

(OO )=4%O O)+4©O) (Rule 3) 
= A-'(-A* — A~*)(C)) + A(C) (Rule 2) 
= AY AP A) A (Rule 1) 
— —4-3 


Now, let’s all try computing the Kauffman bracket! 


Exercise 6.2.3. Show that the bracket polynomial of the left-handed trefoil 
shown in Figure 6.2.1 is A’ — A? — A~°. You may use the results of Example 


6.2.2 to simplify your calculation, if you like. 


Figure 6.2.1: The left-handed trefoil. 


Exercise 6.2.4. (a) Show that when Rule 3 for the bracket polynomial is 
applied to two crossings of a diagram, the order in which it is applied does 
not affect the result. 

(b) Show that when Rules 2 and 3 for the bracket polynomial are applied to a 
link with a disjoint union of the unknot, the order in which they are applied 
does not affect the result. 


Exercise 6.2.5. Find the effect on the Kauffman bracket of performing an R2 
move. In other words, what should X and Y be in Figure 6.2.2? 


Figure 6.2.2: The Kauffman bracket of an R2 move. 
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In Exercise 6.2.5, you proved that the Kauffman bracket is invariant under 
the R2 move. Indeed, this invariance is how Kauffman determined the terms 
A and (-A* — A’) in the definition of the Kauffman bracket. Let’s take a 
look at how the invariance of the Kauffman bracket under R3 moves is a 
direct result of its invariance under R2 moves as shown in Figure 6.2.3. 


Figure 6.2.3: The Kauffman bracket of an R3 move. 
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In order to determine whether or not the Kauffman bracket is a link invariant, 
it remains to consider the R1 moves. As we saw in Example 6.2.2, there 
seems to be a problem with this type of Reidemeister moves. Indeed, the 
Kauffman bracket is not invariant under R1 moves. 


Figure 6.2.4: The Kauffman bracket of an R1 move. 
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Exercise 6.2.6. Determine the coefficients U and V of the bracket equations 
in Figure 6.2.4. 


Can you think of another quantity we've seen that is invariant under R2 and 
R3 moves, but not R1 moves? That’s right! The writhe! (Recall Definition 
5.1.1.) Because both the writhe and the Kauffman bracket fail to be link 
invariants in similar ways, the writhe of a link is used to adjust for the lack of 


invariance of the bracket under an R1 move. Using this clever trick, we can 
create a link invariant. 


Based on our findings in Exercise 6.2.6, we see that multiplying the bracket 
polynomial (L) of an oriented link L by (-A)?”“) where w (L) is the writhe of 
L, will cancel out the noninvariant behavior of the bracket of an R1 move. 
Using Exercises 5.1.7 and 6.2.5 as well as the computation in Figure 6.2.3, 
we can prove that the polynomial, K;(A), called the Kauffman bracket 


polynomial or the Kauffman polynomial, 
K,(A) = (—A)~”) (L) 


is a link invariant. Notice that for a knot, the invariant does not depend on the 
choice of orientation, by our result in Exercise 5.1.3. However, for a link of 
two or more components, the writhe will depend on the choice of orientation 
of the components. (See Exercise 5.1.4.) This is precisely why we now 
require L to be oriented. 


Exercise 6.2.7. Prove that the Kauffman bracket polynomial is an invariant 
of oriented links. 


At last, we are in a position to define the Jones polynomial. 


Definition 6.2.8. The Jones polynomial of a link L, denoted by V,(0), is 


obtained by making the variable substitution A = 4 


K,(A). 


in the polynomial 


Exercise 6.2.9. Using Exercise 6.2.3, determine the Jones polynomial of the 
left-handed trefoil knot. 


Exercise 6.2.10. Calculate the Jones polynomial of the right-handed trefoil 
knot. Use Exercise 6.2.9 to conclude that the trefoil knot is not equivalent to 
its mirror image. 


Exercise 6.2.11. Choose orientations for the components of the Whitehead 
link (see Figure 5.2.1) and compute the Jones polynomial. Compute the Jones 
polynomial of the unlink with two components. Use these results to conclude 
the Whitehead link is a nontrivial link. (To do your Kauffman polynomial 


computation, you may prefer to use the diagram of the Whitehead link with 
five crossings, shown in Table 1.1, example (i).) 


6.3 Tait’s Conjecture 


The discovery of the Jones polynomial led to an explosion of new ideas and 
results in knot theory. One such result is a proof of one of the renowned Tait 
Conjectures, which were first proposed in the late 19th century by physicist 
Peter Guthrie Tait [38]. We alluded to one of the Tait Conjectures in Section 
1.3, Exercise 1.3.6. This exercise asks you to play with alternating knots with 
the goal of conjecturing their crossing number. Several diagrams in Figure 
1.3.1 have one or more crossings that can be untwisted, thus reducing the 
number of crossings in the diagram. The crossings that can be untwisted are 
called reducible crossings. A reducible crossing can be readily identified in a 
diagram by observing that the four regions adjacent to a reducible crossing 
are not distinct regions in the diagram, as in Figure 6.3.1. 


Definition 6.3.1. A reducible crossing is a crossing in a knot diagram such 
that a single region in the diagram is twice adjacent to that crossing. A 
diagram without any reducible crossings is called a reduced diagram. 


Figure 6.3.1: An example of a diagram with a reducible crossing. The shaded 
region below is twice adjacent to the reducible crossing c. 
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Exercise 6.3.2. Consider the diagrams in Figure 1.3.1. Find all of the 
crossings in these diagrams that are reducible. Are any of these diagrams 
reduced? 


The idea of the first Tait Conjecture is that once an alternating diagram is 
reduced, the number of crossings in the diagram cannot be decreased. 


Theorem 6.3.3. [Tait’s Conjecture] If L is a connected and reduced 
alternating diagram of a link with n crossings, then the crossing number of L 
is n. Equivalently, any two connected, reduced alternating diagrams of a link 
L have the same number of crossings. 


As you can see, Tait’s Conjecture specifically applies to connected link 
diagrams. Let’s look at a definition. 


Definition 6.3.4. A link diagram L is called connected provided that the 
corresponding link shadow is a connected subspace of the plane. 


We will prove Tait’s Conjecture using the Kauffman polynomial. However, 
rather than using the recursive definition of the bracket given in Definition 
6.2.1, we introduce an explicit formula for the Kauffman bracket in terms of 
smoothed states of a diagram. 


Recall the recursive formula for the Kauffman polynomial that reduces 
evaluating the Kauffman bracket on a diagram with n crossings to evaluating 
it on a diagram with n — 1 crossings. 


(A) = AQ) () + AC) 


We call the smoothing that is paired with the coefficient A an A-smoothing 
and the smoothing that is paired with the coefficient A‘ an A~!-smoothing of 
the crossing. For a link diagram L, the diagram obtained by performing a 
smoothing of type A or type A! for every crossing of L is called a smoothed 
state of L. 


For a link diagram L and state S of L that contains i A-smoothings and j A-!- 
smoothings, we define the bracket of L for the state S as 


(L|S) = A*(A7?). 


Figure 6.3.2: The figure eight knot—commonly referred to by its name, 4,, 
from the Rolfsen knot table [33]—together with two of its 16 smoothed 
states. Each crossing in the smoothed state is labeled with an A or A“!, 


according to its smoothing type. 
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For example, using the notation from Figure 6.3.2, we have the following. 


(4:|S1) = AAS = A? 
(41|S2) = A* 


Given a link L with n crossings, the explicit formula for the Kauffman 
bracket polynomial, (1), is obtained by considering all possible states, S, of L. 
Since each crossing can be smoothed in two distinct ways, L will have 2” 
associated states. We take the sum over all states S of L of the product of (|) 
with a factor that accounts for the number of connected components in that 
state. 


Theorem 6.3.5. Given a link L with n crossings, the Kauffman bracket of L is 
given by 


(L) = S “(L|S)(—A? — A-?)IS|-1 


S 


where |S| denotes the number of components of the smoothed state S of L. 


In our Figure 6.3.2 example, the summands of (4;) corresponding to the states 
S, and S> are: 


(4.\S)(=4"* = 42-4 = AA = AO A“ 
(4;|So)(—A? — A-?)!S2I-1 — 44(—A? — A-?)? = AP 4+2A441 


Exercise 6.3.6. Show that the Kauffman bracket of the figure eight knot 
diagram in Figure 6.3.2 is 


AP Me 44 AP 4 Are 


by completing the computation started above and using Theorem 6.3.5. (Hint: 
There are 14 more states to consider.) 


Exercise 6.3.7. Calculate the Kauffman polynomial of the left-handed trefoil 
knot using the explicit formula for the Kauffman polynomial in Theorem 
6.3.5. Compare your answer with the one you obtained in Exercise 6.2.3. 


Exercise 6.3.8. Using the intuition you gained from completing Exercises 
6.3.6 and 6.3.7, prove Theorem 6.3.5. (Hint: Proceed by induction on the 
number of crossings in a knot diagram.) 


Exercise 6.3.9. Conjecture and prove a relationship between the Kauffman 
bracket polynomial of a link L and its mirror image L™. (Hint: First, look at 
the relationship between the Kauffman bracket of the left- and right-handed 
trefoils for inspiration. Next, test your conjecture. The knot 4i is equivalent to 
its mirror image. Is your conjecture consistent with this fact?) 


Now that we are armed with an explicit formula for the Kauffman bracket, 
we can work toward a proof of our main result. Our proof of Tait’s 
Conjecture makes use of the Kauffman bracket via its span, defined as 
follows. 


Definition 6.3.10. For a link diagram L, the Kauffman span of L, span(L), is 
the maximum degree of the Kauffman bracket of L, (Z), minus the minimum 
degree of (L). 


Exercise 6.3.11. (a) What is span(4,)? (b) How does the span of the left- 


handed trefoil compare to the span of the right-handed trefoil? 


Interestingly, while the Kaufman bracket fails to be a link invariant, its span 
is a link invariant. 


Theorem 6.3.12. The Kauffman span of a link L, span (L), is a link invariant. 


Exercise 6.3.13. Prove Theorem 6.3.12. (Hint: Make use of the results of 
Section 6.2.) 


Our strategy for proving Tait’s Conjecture will involve determining the span 
of a connected alternating link diagram. 


Exercise 6.3.14. Let c be a crossing of connected link diagram L. Suppose 
the link diagram L is checkerboard colored such that, at the crossing c, the 
shaded regions adjacent to c are the regions that would be joined by an A- 
smoothing at c. Show that if L is alternating, then at every other crossing in 
the diagram, the shaded regions are the regions that would be joined by an A- 
smoothing of the crossing. (See Figure 6.3.3 for examples of such 
checkerboard colorings.) 


Figure 6.3.3: The left-handed and right-handed trefoils with A-smoothing 
checkerboard colorings. 


Our next goal is to prove the following lemma about the maximum and 
minimum degrees of the variable A in the Kauffman bracket. 


Lemma 6.3.15. Suppose L is a connected, reduced, alternating link diagram. 


Suppose L is checkerboard colored so that at each crossing of L the shaded 
regions are precisely the regions joined by an A-smoothing of the crossing. 
Then the terms of highest and lowest degree in (L) are given by 


max degree term = (—1)¥-1 gor(L)+2W—2 


min degree term = el alm aa ae alec cd 


where cr(L) is the number of crossings in L, W is the number of unshaded 
regions in the diagram, and B is the number of shaded regions in the 
diagram. 


Exercise 6.3.16. Checkerboard color the knot 4, and use your calculation 
from Exercise 6.3.6 to verify that Lemma 6.3.15 holds for this example. 


We will work toward a proof of Lemma 6.3.15 by completing the following 
exercises. Assume that the hypotheses of Lemma 6.3.15 hold for 
Exercises 6.3.17, 6.3.18, 6.3.19, and 6.3.20. 


Exercise 6.3.17. Let S be the state of all A-smoothings of the crossings of L. 
Calculate the contribution of S to the Kauffman bracket (). Show that the 
max degree term of this contribution is 


eee 


To prove Lemma 6.3.15, it suffices to show that this max degree term from 
state S (with all A-smoothings) does not cancel with a term coming from the 
contribution of some other state of L. 


Exercise 6.3.18. Let S be the state of all A-smoothings and let S’ be some 
other state. Observe that S’ can be obtained from S by changing one 
smoothing at a time from an A-smoothing to an A~!-smoothing. Let S = So, 
Sj, So... , 5, = S' be a sequence of states of L that results in S’ such that each 
state has exactly one more A“! smoothing than the previous listed state. Prove 
that the maximal degree contribution of S,,, is less than or equal to the 
maximal degree contribution of S;. 


Exercise 6.3.19. Use the fact that the diagram L is reduced to prove that the 


maximal degree contribution of S, is strictly less than that of So. 


Exercise 6.3.20. The previous exercises imply that the maximum degree term 
of (L) is 


(—] _ Arh) +2W —2 


Follow steps similar to Exercises 6.3.17, 6.3.18, and 6.3.19 to prove that the 
minimum degree term of (L) is 


| at: ial ee 


Exercise 6.3.21. Suppose that L is a connected, reduced, alternating link 
diagram. Use the previous results to determine the span of L. (Note that the 
span will be in terms of the number of crossings, cr(L), in L and the numbers 
of shaded and unshaded regions, B and W, in a certain checkerboard coloring 
of L.) 


Before we complete the proof of Tait’s Conjecture, we need to take a brief 
and helpful detour to learn about a topological invariant called the Euler 
characteristic, discovered around 1750 by the great mathematician Leonhard 
Euler. 


Definition 6.3.22. Suppose X is a topological space composed of V vertices, 
E edges, and F ‘faces’ or regions. Then the Euler characteristic of X, 
denoted y(X), is defined by the formula 


V(X) =V-E+F. 


To see how the Euler characteristic is computed, let us look at one of the 
simplest topological spaces: a sphere. It is not hard to show that the Euler 
characteristic of the sphere is equal to 2. Indeed, the sphere can be 
decomposed into one vertex (say, somewhere on the equator), an edge that 
travels along the equator and is bounded on both ends by the single vertex, 
and two ‘faces’ or regions consisting of the northern and southern 
hemispheres, as in Figure 6.3.23. In this case, V= 1, E = 1, and F = 2, and so 
y=1-1+2=2. 


Figure 6.3.23. A sphere composed of one vertex, one edge, and two faces. 


What makes the Euler characteristic so useful is that it is a topological 
invariant. In other words, no matter how we decide to view a topological 
space X as a collection of vertices, edges, and faces, x(X) remains unchanged. 
In particular, there are any number of other ways we could have chosen to 
divide the sphere into vertices, edges, and faces, but regardless of which 
choice we make, y = 2. 


You might wonder, how does this relate to our current situation? Suppose we 
draw a connected link diagram L on the surface of a sphere. If we forget L’s 
crossing information, we can view the diagram as a graph, where the 
crossings in L are vertices and the arcs between adjacent crossings are edges. 
The regions bounded by the edges of the graph form faces on the surface in 
which L lies. The outer region of a link diagram in the plane forms a strange 
sort of face. Notice that, if instead of viewing our graph as lying in the plane, 
we view it as lying on a sphere, this outer face is far more like the other faces 
in the diagram. 


Figure 6.3.24. A sphere decomposed into vertices, edges, and faces by a 
connected link shadow. 


Exercise 6.3.25. Verify that the Euler characteristic of the sphere S is 2, using 
the decomposition of S into vertices, edges, and faces created by the knot 
shadow pictured on the sphere in Figure 6.3.24. 


Next, in order to harness the Euler characteristic to relate the number of 
crossings to the number of regions in a link diagram, we need to determine 
the relationship between the number of crossings and the number of arcs ina 
link diagram. In other words, what is the relationship between the number of 
vertices and edges in the graph created by a link shadow? 


Exercise 6.3.26. View the shadow of a connected link diagram L as a graph 
on a sphere by forgetting the crossing information. Conjecture and prove a 
relationship between the number of vertices and the number of edges in the 
graph in terms of the number of crossings of L. (Hint: Draw three different 
link shadows, each with 5—10 precrossings. For each example shadow, count 
the number of vertices and edges in the graph and compare with the number 
of crossings in your link diagram.) 


Now we are convinced that the shadow of a connected link diagram L gives 
rise to a connected planar graph on the sphere such that V = cr(L), E = 2cr(L), 
and F= B+ W. Thus, 


2= x(S) 
=V-E+F 
= cr(L) — 2cr(L)+(B+W). 


It follows that 
B+W =2+cr(L). 


At last, we can use this relationship between crossings and regions to prove 
the main result. 


Exercise 6.3.27 (Proof of Tait’s Conjecture). Prove that any two connected, 
reduced, alternating link diagrams for a given alternating link have the same 
number of crossings. (Hint: Let L and L’ be two connected, reduced, and 
alternating diagrams of the same link. Prove that cr(L) = cr(L') by observing 
that span(L) = span(L’), and then simplifying by using Exercise 6.3.21 
together with the result we just derived.) 


Notes 


‘Laurent polynomials are finite polynomial expressions that can include negative powers as well as 
positive powers of variables. 


Chapter 7 


Unknotting Operations & Invariants 


7.1 Unknotting Operations 


One interesting measure of complexity in knot theory concerns how difficult 
it is to unknot a nontrivial knot if certain unknotting operations are allowed. 
In this section, we explore several methods of unknotting. We will then be 
able to consider some questions related to knot complexity. 


As we Saw in Section 1.4, the most fundamental unknotting operation is that 
of a crossing change, pictured in Figure 7.1.1 below. 


Figure 7.1.1: The crossing change move. 
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Exercise 7.1.1. Suppose the knot diagrams in Figure 7.1.2 need to be 
unknotted. Perform a number of crossing changes on these diagrams to 
produce diagrams of the unknot. In each case, how many crossing changes 
did you need? (Could you have used fewer?) 


Figure 7.1.2: Knot diagrams to be unknotted. 
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Exercise 7.1.2. Using what you discovered in Section 1.4, prove the 
following. Given any knot diagram, a number of crossing changes can be 
performed on the diagram to produce a diagram of the unknot. 


Exercise 7.1.3. Prove that no more than half the crossings in a knot diagram 
must be changed in order to produce a diagram of the unknot. (Hint: The 
mirror image of a diagram of an unknot is also a diagram of the unknot. 
Prove this and illustrate how this fact relates to the exercise.) 


While the crossing change move is the standard unknotting operation, there 
are several more local moves that can be used to produce the unknot. It may 
not be obvious to see why, but the #-move and the a-move, defined in Figure 
7.1.3, are two more examples of unknotting operations in the following sense. 
Given any knot diagram D, there is a sequence of Reidemeister moves and #- 
moves (or 4-moves) that can be performed on D to produce an unknotted 
circle. 


Figure 7.1.3: The unknotting #-move and a-move. 
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These unknotting operations (and others) are studied in [2, 26, 27]. 


Another operation that can be applied to knot diagrams is called the region 
crossing change, or RCC move. Given a diagram of a knot, an RCC move is 
performed by choosing a region R in the diagram and changing all crossings 
that fall along the boundary of that region. An illustration of how this move 
works is shown in Figure 7.1.4. We can also allow the RCC move to be 
performed on the region surrounding the knot. 


Observe that the #-move shown in Figure 7.1.3 is just a special type of RCC 
move. 


Exercise 7.1.4. Consider the knot diagrams in Figure 7.1.2. For each 
diagram, perform an RCC move on the region outside of the diagram. What 
does the resulting diagram look like? In each case, how many crossings were 
affected by the RCC move? 


Exercise 7.1.5. Consider the knot diagrams (a1) and (b1) on the left in Figure 
7.1.4. Perform RCC moves on some number of regions in each diagram to 
produce diagrams of the unknot. What is the fewest number of moves needed 
to unknot each diagram? 


Figure 7.1.4: The RCC move performed on a region R in two different knot 
diagrams. 
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Exercise 7.1.6. Determine the effect of performing an RCC move twice on 

the same region in a knot diagram. Formulate and prove a conjecture about 

the effect of performing the RCC move an even number of times on a given 
region of a knot diagram. 


Exercise 7.1.7. Consider the two knot diagrams in Figure 7.1.2. For each of 
the two diagrams, find a set of regions such that performing RCC moves on 
those regions produces an unknot diagram. 


Exercise 7.1.8. The knot diagram (a) on the left in Figure 7.1.2 is 
nonalternating. Find a set of regions on which you can perform RCC moves 
to produce an alternating knot diagram. 


In Exercise 7.1.2, you showed that the crossing change move is an unknotting 
operation. Perhaps this is an unsurprising fact. It is much more surprising, 
however, that the RCC move can be used to unknot any knot diagram, not 
just those diagrams you unknotted in Exercises 7.1.5 and 7.1.7. A young 
Japanese mathematician named Ayaka Shimizu was the first to prove our 
next main theorem, Theorem 7.1.9, in [37]. Using her strategy, we will prove 
that the RCC move is an unknotting operation. 


Theorem 7.1.9. Given any knot diagram, it is possible to obtain a diagram of 
the unknot by performing some number of region crossing change moves on 
the diagram. 


Our global strategy to prove Theorem 7.1.9 will be to piggyback on the result 
that the crossing change (CC) move is known to be an unknotting operation, 
as we showed in Exercise 7.1.2. So it suffices to prove that, for an arbitrary 
crossing c in a knot diagram D, there exists a collection of regions in D on 
which RCC moves can be performed to change c and only c. This will prove 
that the RCC move is also an unknotting operation since each crossing 
change that is required to unknot the diagram can be achieved by performing 
RCC moves on some collection of regions in the diagram. Keeping this in 
mind, let’s focus our efforts to prove the following proposition. Theorem 
7.1.9 will follow as a corollary. 


Proposition 7.1.10. Given any knot diagram D and a crossing c in D, there 
exists a collection of regions in D on which RCC moves can be performed in 
order to change the crossing c and only crossing c in D. 


Our goal is to prove Proposition 7.1.10 in two cases, first for the special case 
of reduced knot diagrams, as in Definition 6.3.1. Then, we will construct an 
argument by induction on the number of reducible crossings to prove the 
proposition in general. To see why these two cases may need separate 
treatment, we make the following observations. 


Figure 7.1.5: Regions A, B, C, and D adjacent to a crossing. 


Exercise 7.1.11. Consider region A in Figure 7.1.5. 


1. Assuming this crossing appears in an arbitrary knot diagram, determine 
which of the remaining regions (if any) could actually be the same 
region as A. (Hint: It may help to think about this question in terms of 
the crossings in a reducible diagram, e.g., diagrams (b1) and (b2) in 
Figure 7.1.4.) 


2. Assuming this crossing appears in a reduced knot diagram, determine 
which of the remaining regions (if any) could be the same region as A. 


3. Summarize your observations. 


Exercise 7.1.12. Suppose the crossing in Figure 7.1.5 appears in a reduced 
knot diagram. Determine the effect on the crossing of performing RCC 
moves on: 


(i) an even number of the regions A, B, C, and D; 
(ii) an odd number of the regions A, B, C, and D. 
Explain why the assumption that our knot diagram is reduced is important. 
In Definition 5.5.4, we introduced the idea of a checkerboard coloring of a 


knot or link diagram. As it happens, checkerboard colorings play an 
important role in figuring out which RCC moves should be performed to 


change exactly one crossing in a diagram. To continue setting the stage to 
prove Proposition 7.1.10 in the case where the diagram D is reduced, let’s 
work through the following exercises. 


Exercise 7.1.13. Consider the knot diagram (al) in Figure 7.1.4, ignoring the 
shading in the region labeled R. Draw this diagram together with a 
checkerboard coloring. Determine what happens when RCC moves are 
performed on all of the shaded regions. 


Exercise 7.1.14. Suppose, more generally, that you are given a reduced knot 
diagram D and its checkerboard coloring. Formulate and prove a conjecture 
that describes the effect of performing RCC moves on all of the shaded 
regions in the checkerboard coloring of D. 


With these observations in hand, let’s begin to prove Proposition 7.1.10 for 
the reduced diagram case. The following three-step algorithm can be used to 
determine the regions on which RCC moves will be applied to achieve our 
desired crossing change in a reduced diagram. 


Step 1: Assign an orientation to the reduced knot diagram D and smooth 
the diagram at crossing c with respect to the orientation, as pictured in 
Figure 7.1.6. 


Exercise 7.1.15. Explain why the choice of orientation in Step 1 does not 
influence the resulting smoothed diagram. 


Exercise 7.1.16. Explain why the diagram that results from Step 1 is a two 
component link. How do the two arc segments that come from the smoothing 
contribute to the two components? 


After Step 1, the new diagram will contain a newly formed region which 
consists of two regions from the original knot diagram. Call this new region 
R, as pictured in Figure 7.1.6. 


Figure 7.1.6: Smoothing at a positive or negative crossing in the direction of 
the orientation. 
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Step 2: After smoothing at crossing c, select one of the two components 
in the link diagram. For the selected component, completely ignoring the 
other, checkerboard color it so that the region R is unshaded. 


To clarify Step 2, Figure 7.1.7 illustrates two examples where one component 
of a link is checkerboard colored while the other component is ignored. 


Step 3: Using the shaded regions from Step 2, shade the corresponding 
regions in our original knot diagram and perform RCC moves on 
precisely these regions (exactly once per region). 


Exercise 7.1.17. Apply the three-step algorithm as follows to the reduced 
diagram in Figure 7.1.8 to see how the algorithm works. 


Figure 7.1.7: Checkerboard coloring a single link component in a diagram 
with two components. 


Figure 7.1.8: A knot to be simplified via RCC moves. 
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1. Apply Step 1 to the diagram in Figure 7.1.8, smoothing at the crossing 
labeled c. Label the newly formed region R. 


2. Choose one component of this diagram to checkerboard color and 
shade regions following Step 2. Note that one component choice will 
result in a rather trivial checkerboard coloring (where essentially all 
that needs to be done is to shade the interior of a disk) while the other 
component choice leads to a more interesting checkerboard coloring. 
Either choice will work. 


3. Follow Step 3 to identify the regions in the original knot diagram we 
can apply RCC moves to in order to change crossing c and only 
crossing c. Verify that performing the RCC move on these regions 
gives us our desired result. 


Now, let’s back up and see what would have happened if we had made the 
other component choice in Step 2. 


Exercise 7.1.18. You made a choice in Exercise 7.1.17 about which link 
component to checkerboard color. Now, go back to your smoothed diagram 
and select the other component. Follow Steps 2 and 3 to find an alternate 
collection of RCC moves that will switch crossing c and only crossing c. 
Verify that your collection of regions changes crossing c and only crossing c. 


Exercise 7.1.19. Go through steps 1—3 once again to identify a set of regions 
on which RCC moves can be applied in Figure 7.1.8 in order to change 
crossing c’ and only c’. 


Now, we have two sets of regions: one set of regions on which RCC moves 
can be performed to change c, and another set of regions on which RCC 
moves can be performed to change c’. What if we want to change both 
crossings? In the next exercise, we'll explore how to harness the previous 
results to perform more than one crossing change at a time. 


Exercise 7.1.20. Identify a set Q of regions of the diagram D in Figure 7.1.8 
for which the following holds. If an RCC move is applied exactly once to 
each region in Q, both crossings c and c’ will change, and all other crossings 
will stay the same. (Hint: Use your result from Exercise 7.1.12.) 


The examples above give good evidence that the three-step algorithm indeed 
determines a set of regions in a reduced diagram for which the application of 
RCC moves will successfully change a specified crossing. Now we must 
prove that this process works in general. 


Lemma 7.1.21 (Proposition 7.1.10: The Reduced Diagram Case). Given a 
reduced diagram, D, and a crossing c in D, there exists a collection of 
regions in D on which RCC moves can be performed in order to change the 
crossing c while leaving all other crossings in D unchanged. 


Exercise 7.1.22. Prove Lemma 7.1.21. In other words, show that the three- 
step algorithm always works. (Hint: There are two natural steps in this proof. 
(1) Prove that if RCC moves are performed on the identified collection of 
regions, then crossing c changes. (2) Prove that, for a crossing d # c in the 
diagram, d remains unchanged after the RCC moves have been performed. 
Be sure you know why your proof only works for reduced knot diagrams! ) 


Now that we have shown Proposition 7.1.10 holds for reduced knot diagrams, 
let’s consider diagrams with reducible crossings. We will work toward 
understanding which regions to select in an arbitrary reducible diagram by 
first considering the case of a knot diagram with exactly one reducible 
crossing. A careful analysis of this case will form the foundation for an 
inductive argument on the number of reducible crossings in a knot diagram. 
Our next goal, then, will be to prove the following lemma. 


Lemma 7.1.23 (Proposition 7.1.10: The Case of Exactly One Reducible 
Crossing). Given a diagram, D, with exactly one reducible crossing d, and a 


specified crossing c (possibly equal to d) in D, there exists a collection of 
regions in D on which RCC moves can be performed in order to change the 
crossing c while leaving all other crossings in D unchanged. 


To prove Lemma 7.1.23, we consider two cases. In Case 1, the crossing d is 
precisely the crossing we wish to change. In Case 2, we would like to change 
a reduced crossing, c # d, of D. Let’s explore an example to determine how to 
attack the simpler case, Case 1. 


Exercise 7.1.24. The diagram in Figure 7.1.9 has one reducible crossing d. 
Determine how a portion of the diagram can be checkerboard colored such 
that, if an RCC move is performed on each shaded region, the reducible 
crossing d and only d is changed. 


Figure 7.1.9: A reducible diagram with exactly one reducible crossing, d. 


Using this example for intuition, prove Case 1 of Lemma 7.1.23. 


Exercise 7.1.25. (Proof of Case 1 of Lemma 7.1.23) Describe, in general, 
how to identify a collection of regions in a diagram, D, with exactly one 
reducible crossing, d, such that using RCC moves on these regions changes 
the crossing d and only d. Prove that the regions described achieve the 
desired crossing change. (Hint: You may wish to use the notation or 
smoothing idea described in Figure 7.1.10.) 


Next, we turn to Case 2 of Lemma 7.1.23 where we want to change a 
crossing c that is a reduced crossing in a reducible diagram D. For this case, 
the following notation will come in handy. By definition, the reducible 
crossing d has an associated region that is twice adjacent to d; call this 
associated region A. If the diagram D is smoothed at the reducible crossing d 
with respect to a choice of orientation (as in Step 1 of the three-step 
algorithm), then the smoothed diagram is a disjoint union of two diagrams 
with the region A separating them. Of these two diagrams, we call the 
component that contains c diagram D,. We’|l call the other reduced diagram 
D,. Furthermore, in the diagram D, let B denote the region adjacent to 


crossing d such that after smoothing at d, B can be viewed as a region inside 
D, See Figure 7.1.10 for an illustration of these naming conventions. 


Figure 7.1.10: A diagram with a reducible crossing, d. Before smoothing, the 
regions A and B are adjacent to the crossing d, and region A is twice adjacent 
to d. After smoothing at d, D, is the diagram that does not contain crossing c. 
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Notice that in Case 2 of Lemma 7.1.23, D contains exactly one reducible 
crossing, d. If we smooth at d, there are no more reducible crossings in the 
resulting link diagram. In other words, both of the diagrams D, and D, are 
reduced knot diagrams. By Lemma 7.1.21, the three-step algorithm can be 
used on D, (while ignoring D,) to determine a collection of regions of D, that 
change c and only c. A reasonable guess is that this collection of regions, if 
changed in the original diagram D, will give us our desired result. However, 
there are some subtleties in how these selected regions behave when viewed 


as regions of D. Let’s look at an example to give us ideas for a general proof 
of Case 2. 


Figure 7.1.11: The diagram from Figure 7.1.9 with the reducible crossing d 


smoothed and the knot diagram D, ignored. In D5, the crossing c has been 


smoothed and one component of the resulting link is checkerboard colored 
(so that R is unshaded) while the other link component is ignored. Notice that 
the region A, of D, is the entire large region at the center of the diagram 


containing D,, since D, is ignored. 


Exercise 7.1.26. Return to the example in Figure 7.1.9. Smooth the reducible 
crossing d, respecting the orientation of the knot. Follow the three-step 
algorithm on D, (completely ignoring D,) and determine the selected regions 


of D, that will change c and only c in D5. Regardless of the choice of 


component you made in the three-step algorithm, the region A should be 
shaded. One such component choice is shown in Figure 7.1.11. Draw the 
checkerboard coloring corresponding to the other choice. 


We now take the regions of D, that were selected in Exercise 7.1.26 and 


adjust them to find regions in the original diagram D on which we should 
perform RCC moves. You might ask, why do the regions need adjusting? To 
answer this question, let’s view all of the shaded regions in Figure 7.1.11 as 
regions of D and pay special attention to what happens at d if RCC moves are 
applied to all of these regions. See Figure 7.1.12. 


The crossing c will indeed be changed by these regions, since RCC moves 
are applied to exactly one of the four regions adjacent to c. Unfortunately, the 
crossing d is also changed when RCC moves are applied to all of the shaded 


regions in Figure 7.1.12, since d only has three distinct adjacent regions, and 
all of them are shaded. So, in this example, a modification of the selected 
regions is required to determine an unknotting set of regions. 


Figure 7.1.12: When RCC moves are performed in the shaded regions of D, 
two crossings change in the diagram. 


It should not be surprising that the adjustments we'd like to make are in the 
portion of the diagram that has thus far been ignored, namely Dj. 


Exercise 7.1.27. Modify the shaded regions in the diagram in Figure 7.1.12 
by unshading certain regions so that, if we perform RCC moves on all of the 
remaining shaded regions, c and only c is changed. 


Exercise 7.1.28. Use the checkerboard coloring corresponding to the other 
component choice that you found in Exercise 7.1.26 to determine whether 
performing RCC moves on all of the shaded regions of your diagram (viewed 
as regions of D) changes c and only c in D. Is a modification to your 
collection of regions needed? Why or why not? What adjustment to the set of 
regions (if any) will allow us to change c and only c by performing RCC 
moves on the regions in the set? 


Using Exercises 7.1.27 and 7.1.28 for intuition, let’s go back to the general 
setting of Case 2 of Lemma 7.1.23. We will let Q denote our desired 


collection of regions of D on which RCC moves can be applied to change 
crossing c and only c. By Lemma 7.1.21, since D, is reduced, there exists a 


collection of regions of Dy, call it P, such that performing RCC moves on 
these regions achieves the desired crossing change at c and only c in Dy. 
(Once again, we are ignoring D, and its regions, as in Figure 7.1.11.) Let’s 
call A, the region of D, that contains A, as illustrated in two different types of 
examples in Figure 7.1.13. Since the regions Ay and B may or may not be in 


Pp, there are four mutually exclusive possibilities to consider in order to 
determine what Q should be. We begin by including in Q all of the regions in 
P—each now viewed as a region of D—except for A if Ap is in P (since A> is 
not a region of D). 


Figure 7.1.13: Two examples illustrating the relationship between A in D and 
Ap in Do. 


Here are the four subcases. 


1. Suppose P contains both regions A, and B, as in Figure 7.1.14. 
Checkerboard color D, (which sits inside A,) in such a way that the 


surrounding region A is included in the set of shaded regions. Add to Q 
both A and all the newly shaded regions of D, that lie within Ap. 


Figure 7.1.14: Subcase 1. 
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2. Suppose A, is in P and Bis not in P, as in Figure 7.1.15. In this case, 
add to Q all regions inside D,, as well as A. 


3. Suppose A, is not in P and B is in P. Determining the set of regions, Q, 
in this case is left as Exercise 7.1.29. 


Figure 7.1.15: Subcase 2. 
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4. Finally, suppose that A, is not in P and B is not in P, as in Figure 
7.1.16. Then neither regions inside D, nor region A should be included 


inQ. 
Figure 7.1.16: Subcase 4. 
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Exercise 7.1.29. Complete the description of regions Q in Subcase 3, 
assuming A, is not in P and B is in P. Provide a figure in support of your 


description, analogous to the figure for each case given above. 


Exercise 7.1.30. (Proof of Case 2 of Lemma 7.1.23) For each of the four 
subcases above, prove that if Q contains exactly the regions specified, then 
performing RCC moves on all regions in Q will change c in D and no other 
crossings. 


We are now more than ready to prove Proposition 7.1.10. 


Proposition 7.1.31 (Inductive Step of Proposition 7.1.10). Suppose D is a 
reducible knot diagram containing at least one reducible crossing. Let c be 
any crossing in D, perhaps a reducible crossing. Then there is a subset of 
regions Q of D on which RCC moves can be performed in order to change 
crossing c, while leaving all other crossings in D unchanged. 


Proof of Proposition 7.1.31. We will argue by induction on k, the number of 
reducible crossings in D. Lemma 7.1.21 states that Proposition 7.1.10 holds 
for a reduced diagram, that is k = 0, while Lemma 7.1.23 states that 
Proposition 7.1.10 holds for k = 1. This ‘second’ base case holds the key to 
the inductive step! At the heart of the proof of Lemma 7.1.23 (Case 2), we 
smoothed at a reducible crossing, leaving two disjoint diagrams, D, and D5, 


where we assumed that D, contains c and one fewer reducible crossing. 


Suppose Proposition 7.1.31 holds for all knot diagrams with k or fewer 
reducible crossings. Let c be a crossing in a knot diagram with k + 1 
reducible crossings. 


Exercise 7.1.32. Complete the proof of Proposition 7.1.31. (Hint: Consider 
two cases, one where c is a reducible crossing and one where c is reduced. 


First, consider the case where c is reducible, and use what you learned in 
Exercise 7.1.25. For the second case, argue that there must exist a reducible 
crossing d in D such that d # c and smoothing at d results in two disjoint 
diagrams: D, containing c, and D, which is reduced. Then apply the 
inductive hypothesis to D, and complete the proof using four cases analogous 


to Lemma 7.1.23.) 


7.2 The Unknotting Number 


In Section 1.2, the unknotting number of a knot is defined to be the 
minimum number of times the knot must pass through itself before it 
becomes unknotted. In other words, the unknotting number of a knot K is the 
minimum number of crossing change moves that must be performed on a 
diagram of K to produce the unknot, where the minimum is taken over all 
diagrams of K. We denote the unknotting number of K by u(k). 


Exercise 7.2.1. Explain why the unknotting number of a knot is a knot 
invariant. 


Exercise 7.2.2. Prove that a knot K has unknotting number 0 if and only if K 
is the unknot. 


Exercise 7.2.3. Prove that all twist knots have unknotting number one. 


Exercise 7.2.4. Show that for the (2, p) torus knot, T5 ,,, the unknotting 
number satisfies u(T),,) < 2+. (In fact, u(T,,) = 2+, but equality is more 


difficult to prove! Note for this problem that p must be odd for T, , to be a 
knot rather than a link.) 


For cases where it is less obvious how to unknot a knot diagram, we can use 
the following easily computable upper bound to determine partial information 
about the unknotting number. Suppose we have a knot K with diagram D. 
Form the Gauss diagram Gp associated to D, as described in Chapter 4. The 


trivializing number, t(D), is equal to the minimum number of arrows that 
must be removed from Gp so that no arrows intersect. 


For instance, in Figure 7.2.1, we picture a twist knot, its Gauss diagram, and 
a subdiagram of the Gauss diagram containing only nonintersecting arrows. 
In this example, t (D) = 2 since two arrows (1 and 5) had to be removed to 
produce a diagram with only nonintersecting arrows. 


Figure 7.2.1: A knot diagram D, its Gauss diagram Gp, and a subdiagram of 


Gp containing no intersecting arrows. 
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The trivializing number was originally defined by Ryo Hanaki in [13]. 
Building on Hanaki’s work, a group of undergraduate researchers working in 
the Williams College SMALL REU proved the following result [15]. 


Theorem 7.2.5. For any diagram D of a knot K, 


t(D) 
S=s 


u(K) 


Exercise 7.2.6. Use Theorem 7.2.5 to give an alternative proof that the 
unknotting number of any twist knot is 1. 


Exercise 7.2.7. Consider the knot diagrams in Figure 7.2.2. Find an upper 
bound on the unknotting number for each knot. (Hint: Find the trivializing 
number for each diagram and use Theorem 7.2.5.) 


Figure 7.2.2: Diagrams to be unknotted. 


(a) (b) (c) 


In Exercise 7.2.7, you found upper bounds for the unknotting numbers of 
several knot diagrams. In particular, you discovered indirectly that diagram 
(c) can be unknotted with three crossing changes. Surprisingly, if we make 
the diagram of this knot more complex, the knot can be unknotted with fewer 
crossing changes! 


© 


Exercise 7.2.8. Observe two diagrams of the pretzel knot Ps ; , in Figure 
7.2.3. Verify that (i) the minimal crossing diagram of P; , , can be unknotted 


by changing its three highlighted crossings; and (ii) the diagram with more 
crossings can be unknotted by changing its two highlighted crossings. 


Figure 7.2.3: Nakanishi-Bleiler example. 


The pair of diagrams in Figure 7.2.3 was discovered independently by 
Nakanishi [28] and Bleiler [6] as an example showing that the unknotting 
number of a knot might only be achievable on a diagram that has more than 


the minimum number of crossings. This is one of the factors that makes the 
unknotting number of a knot particularly difficult to compute. 


One famous result that has been used to determine certain unknotting 
numbers was proven by Martin Scharlemann [35]. Recall, from Section 1.6, 
the definitions of prime and composite knots. His classical theorem 
concerning unknotting numbers that is easy to understand, but difficult to 
prove, is the following. 


Theorem 7.2.9. Any composite knot has unknotting number at least 2. 
Equivalently, all knots with unknotting number 1 are prime. 


Exercise 7.2.10. Use Theorem 7.2.9 to answer the following questions. 


1. What can you conclude about the unknotting number of the square knot 
(pictured in Figure 5.5.1)? 


2. What does Exercise 7.2.3 imply about the family of twist knots? 


7.3 The Region Unknotting Number 


We have just explored the classical unknotting number. Similarly, we can 
define unknotting numbers for the other unknotting operations, just as Ayaka 
Shimizu first did for the RCC move in [37]. In particular, the region 
unknotting number, up(K), of a knot K is the minimum number of RCC 


moves that must be performed in a diagram of K to produce the unknot, 
where the minimum is taken over all diagrams of K. Abusing notation, we 
use Up(D) to denote the minimum number of regions that must be changed in 


a particular diagram D of a knot in order to produce a diagram of the unknot. 
Note that if D is a diagram of K, then up(K) < up(D). (Explain why!) 


Exercise 7.3.1. Prove that the following families of knots all have region 
unknotting number 1. 


1. Twist knots 


2. Knots with Conway notation [[m 2 m]] (as in Definition 4.3.40) 


3. Knots with Conway notation [[m 2 (m + 1)]] 


Exercise 7.3.2. Make conjectures about the values of up(K) for each of the 
knots K shown in Figure 7.2.2. 


Exercise 7.3.3. Make a conjecture about the region unknotting number of a 
(2,p) torus knot, up(T> ,) 


In an attempt to learn something about the region unknotting number, one 
natural question to ask is: what is the relationship between up(K) and the 


crossing number c(K) of a knot K? A related, but simpler, question to answer 
is the following: what is the relationship between the region unknotting 
number of a knot diagram and the number of regions in the diagram? Let’s 
explore these questions for the case of reduced diagrams of knots. 


The following terminology will help us in our exploration. Suppose we have 
a reduced knot diagram with a checkerboard coloring. Let B be the set of 
shaded regions in the diagram and W be the set of unshaded regions. Now 
suppose that P is some subset of B and V is some subset of W. Then, B — P 
will denote all shaded regions not in the subset P and W — V will denote all 
unshaded regions not in V. Let’s proceed by working through the following 
two exercises, using what we learned in Exercise 7.1.14 about applying RCC 
moves to all shaded regions in a reduced knot diagram. 


Exercise 7.3.4. An example of a reduced knot diagram with sets of regions B, 
P, W, and V is shown in Figure 7.3.1. 


1. Consider the effect of performing RCC moves on all regions in P. 

2. Consider the effect of performing RCC moves on all regions in B — P. 

3. Consider the effect of performing RCC moves on all regions in V. 

4. Consider the effect of performing RCC moves on all regions in W — V. 
What did you notice? 


Exercise 7.3.5. Formulate and prove a conjecture about the relationship 


between the diagram obtained by performing RCC moves on all regions in P 
and the diagram obtained by performing RCC moves on all regions in B — P. 
Then, formulate and prove a similar conjecture for V and W — V. 


The following exercise uses the floor function, [x], i.e., the function that 
takes in a real number x and returns the largest integer less than or equal to x. 


Exercise 7.3.6. Suppose that D is a reduced, checkerboard colored knot 
diagram and let b denote the number of shaded regions and w the number of 
unshaded regions in D. Let U be a diagram of the unknot with the same 
underlying projection as D. Show that U can be obtained from D using no 
more than n moves, where 


Figure 7.3.1: Subsets of checkerboard colored regions: B, the set of ‘shaded’ 
regions, consists of the regions colored dark and light green, and W, the set of 
‘unshaded’ regions, consists of regions colored dark and light orange. In 
particular, regions in the set P are colored dark green, and regions in B— P 
are light green, while regions in the set V are colored dark orange, and 
regions in W — V are light orange. 


(Hint: Use the result of Exercise 7.3.5.) 


Exercise 7.3.7. Building upon Exercise 7.3.6, prove Theorem 7.3.8. 


Theorem 7.3.8. The region unknotting number up(D) of a reduced knot 
diagram D is no greater than half the number of regions in D. 


This is a nice result in and of itself, but let’s push our investigation a bit 
further to answer our original question. How does the region unknotting 
number of a knot K relate to the crossing number of K? Since Theorem 7.3.8 
tells us about the relationship between up and the number of regions in a 


diagram, we're halfway there. 


Recall the relationship we discovered in Section 6.3 between the number of 
regions, F, in a knot diagram D and the number of crossings, c(D): 


F =2+c(D). 


Exercise 7.3.9. Prove Theorem 7.3.10. (Hint: Use the fact that F = 2 + c(D) 
together with your result from Exercise 7.3.6.) 


Theorem 7.3.10. Let D be a reduced knot diagram and c(D) the number of 
crossings in D. Then 


ur(D) < cD) +2 


Now, we have all the tools we need to show the following corollary about the 
region unknotting number of a knot, not just the region unknotting number of 
a particular diagram of the knot. 


Corollary 7.3.11. Let K be a knot. 


1. If Dis any reduced diagram of K, then up(K) < ““)**, where c(D) 
denotes the number of crossings in D. 


2. In general, up(K) < ““*, where c(K) denotes the crossing number of 
K, as defined in Activity 1.4. (Hint: Use the fact that every knot has a 
reduced diagram.) 


Exercise 7.3.12. Prove Corollary 7.3.11. 


Chapter 8 


Virtual Knots 


8.1 What is a Virtual Knot? 


Think back to Chapter 4 where we first learned about using Gauss diagrams 
to record knot diagrammatic information. Since there is a well-defined 
algorithm for creating a Gauss diagram from a knot diagram, we can see that 
every knot diagram corresponds to a Gauss diagram. You might ask, though, 
about the converse of this statement. Does every well-formed Gauss diagram 
correspond to a knot diagram? The answer is a resounding no. The 
surprisingly simple example in Figure 8.1.1 illustrates this fact. 


Figure 8.1.1: A Gauss diagram without a knot. 


Exercise 8.1.1. Consider the Gauss diagram from Figure 8.1.1. It is 
impossible to construct a corresponding knot diagram. Try to do the 


impossible and construct a knot diagram for this Gauss diagram. In the 
process, discover what goes awry. Describe your findings. 


This fact is precisely what prompted the discovery of a more general 
collection of knots called virtual knots. A virtual knot can formally be 
defined as a Gauss diagram, where two virtual knots are considered to be 
equivalent if and only if their Gauss diagrams can be related by a sequence of 
the Gauss diagrammatic Reidemeister moves that you discovered in Chapter 
4, (See Figure 4.2.5.) 


The theory of virtual knots was independently discovered by Louis 
Kauffman, in [21], and Naoko and Seiichi Kamada, in [17], where the 
knotlike objects are called abstract knots. 

Exercise 8.1.2. Show, using the Gauss diagrammatic Reidemeister moves 
discovered in Chapter 4, that the virtual knots in Figures 8.1.1 and 8.1.2 are 


equivalent. 


Figure 8.1.2: A virtual knot equivalent to the simplest nontrivial virtual knot. 
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It’s clear from the definition that all ordinary knots (i.e., the knots we've been 
studying until now, which we will henceforth refer to as “classical knots”) are 
also virtual knots since they correspond to Gauss diagrams, but there is a vast 


world of nonclassical virtual knots. The simplest example is our friend the 
virtual trefoil shown in Figures 8.1.1 and 8.1.2. Figure 8.1.3 illustrates 
several other simple examples of nonclassical virtual knots. 


Figure 8.1.3: Two distinct nonclassical virtual knots. 
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While virtual knots can be defined in terms of Gauss diagrams, they can also 
be defined in terms of strange sorts of knot diagrams. A virtual knot diagram 
is a knot diagram that may involve both ordinary crossings and virtual 
crossings. The ordinary, classical crossings are the familiar crossings that 
appear as arrows in the Gauss diagram. The virtual crossings are those we are 
forced to draw when reconstructing a knot diagram from a Gauss diagram. 
Virtual crossings can be thought of as not really existing, since they don’t 
appear in the Gauss diagram of a virtual knot. In a virtual knot diagram, 
virtual crossings are drawn as in Figure 8.1.4. 


Figure 8.1.4: The two types of crossings in a virtual knot diagram: a classical 
crossing and a virtual crossing. 


In Figure 8.1.5, a virtual knot is given both in terms of its virtual knot 
diagram and its Gauss diagram. For a simple exercise, identify each arrow in 
the Gauss diagram with its corresponding classical crossing. It is very 
important to remember that the virtual crossings do not appear in the Gauss 
diagram—only classical crossing information is recorded. 


Figure 8.1.5: A virtual knot diagram and its Gauss diagram. 
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Exercise 8.1.3. Construct virtual knot diagrams for the Gauss diagrams in 
Figure 8.1.3. 


Exercise 8.1.4. Prove that any virtual knot diagram containing only virtual 
crossings (and no classical crossings) must be the unknot. (See Figure 8.1.6 


for examples of such diagrams.) 


Exercise 8.1.5. Prove that any virtual knot diagram containing exactly one 


classical crossing must be the unknot. (Hint: The proof of this fact may be 
quite similar to the proof of Exercise 8.1.4.) 


Exercise 8.1.6. Construct virtual knot diagrams that correspond to the Gauss 
diagrams in Figures 8.1.1 and 8.1.2. Can one (or both!) of these diagrams be 
drawn with a single virtual crossing? (Hint: Why might this knot be called the 
virtual trefoil?) 


Figure 8.1.6: Examples of virtual diagrams of the unknot. 


By virtue of our original definition of virtual knot equivalence, we see that 
classical Reidemeister moves can be performed on virtual knot diagrams 
without changing the virtual knot type. What other Reidemeister-type moves 
should be included in a list of R-moves for virtual knots? 


Exercise 8.1.7. Any additional Reidemeister moves we allow for knots with 
virtual crossings should have no effect on the Gauss diagram of the knot. In 
Figure 8.1.7, there are eight potential Reidemister-type moves for virtual 
knots. 


1. Determine which subset of these moves we should call the virtual 
Reidemeister moves. Use your results to write a reference sheet of the 
three classical and four virtual Reidemeister moves that characterize 
virtual knot equivalence. (Hint: Exactly half of these moves should be 
allowed.) 


2. For each of the moves that should not be allowed on our list, provide a 
Gauss diagram schema illustrating the effect of the prohibited move. 


Exercise 8.1.8. Use the set of three classical and four virtual R-moves to 
show that the virtual knot diagrams you constructed in Exercise 8.1.6 
represent equivalent virtual knots. (Notice that this is an alternative way to 
prove the equivalence in Exercise 8.1.2.) 


Exercise 8.1.9. By providing sequences of virtual Reidemeister moves, prove 
that the diagrams in Figure 8.1.6 are all diagrams of the unknot. 


Figure 8.1.7: Eight potential virtual Reidemeister moves. 


Exercise 8.1.10. Create a virtual knot diagram with eight virtual crossings 
and six classical crossings that is equivalent to the virtual knot shown in 
Figure 8.1.5. (Be creative!) Make sure you provide the sequence of R-moves 


that demonstrates the equivalence. 


8.2. Virtual Knot Invariants 


Now that we know what virtual knots are, one big question we might ask is 
the following. How can we tell when a given virtual knot is actually 
nonclassical? In other words, given a Gauss diagram how do we know if we 
actually need virtual crossings to construct the given knot? Similarly, if 
presented with a knot diagram that has virtual crossings, how do we know 
whether it is equivalent to a diagram with only classical crossings? Before we 
discover a partial answer to this question, let’s build some intuition about 
why this task may be tricky! 


Exercise 8.2.1. The diagram in Figure 8.2.1 is a virtual-looking diagram of 
the unknot. Use the classical and virtual R-moves to prove that this is indeed 
the unknot. 


Exercise 8.2.2. Draw several different virtual knot diagrams that are all 
equivalent to the (classical) trefoil but that all have some extraneous virtual 
crossings in them. Show that your diagrams are equivalent to the trefoil by 
providing a sequence of diagrams that are related by virtual and classical R- 
moves. 


Figure 8.2.1: The unknot in a virtual disguise. 


Given a diagram with virtual crossings, it can be difficult to determine 
whether or not the diagram might represent a classical knot. To identify 


certain virtual knots that are nonclassical, we introduce an idea of 
Kauffman’s called crossing parity, first introduced in [19]. 


Definition 8.2.3. A crossing in a virtual knot diagram is called even if its 
corresponding chord in the Gauss diagram intersects an even number of 
chords. Otherwise, we call the crossing odd. 


Exercise 8.2.4. Classify all chords in the Gauss diagram in Figure 8.1.5 as 
either even or odd. Use this classification to determine which crossings in the 
corresponding knot diagram are even and which are odd. 


Exercise 8.2.5. Give an alternate definition for even and odd crossings that 
relies only on the virtual knot diagram, without making reference to the 
corresponding Gauss diagram. Is it possible to avoid any reference to virtual 
crossings in your alternate definition? 


Now consider the following quantity associated to a knot diagram D. 


Definition 8.2.6. The odd writhe, J(D), of a knot diagram D is the sum of 
the signs of all odd crossings in D. 


Notice that if D has no odd crossings, J(D) = 0. 


Exercise 8.2.7. Find the odd writhes of the Gauss diagrams in Figures 8.1.1, 
8.1.2, 8.1.3, and 8.1.5. 


Exercise 8.2.8. Give a virtual knot diagram D for which the value of J(D) is 
—5. 


Exercise 8.2.9. Suppose D is a virtual knot diagram and D” is its mirror 
image (obtained from D by changing all classical crossings of D). Show that 
J(D) =— J (D"). 


In Chapter 5, we learned that the writhe of a knot is not a knot invariant. 
Interestingly, the odd writhe is an invariant for virtual knots. 


Exercise 8.2.10. Prove that the odd writhe is an invariant of virtual knots. In 
other words, prove that J is a function such that if virtual knot diagrams D 


and D' are equivalent, then J(D) = J (D’). 


Exercise 8.2.11. Use the result you proved in Exercise 8.2.10 to find a virtual 
knot diagram D that is not equivalent to its mirror image D”. 


A particularly useful fact about the odd writhe is that J = 0 for all classical 
knots. In other words, if you are able to prove that J(K) ~ 0, this is a proof 
that K is a nonclassical virtual knot! 


Exercise 8.2.12. Use the odd writhe to show that one of the Gauss diagrams 
in Figure 8.2.2 corresponds to a nonclassical knot. Is the other knot classical 
or not? 


Figure 8.2.2: Two virtual knots, given by their Gauss diagrams. 


We have seen that the odd writhe is a useful invariant, but there are limits to 
what it can achieve. Indeed, as you may have discovered, there are 
nonclassical virtual knots for which J(K) = 0. The invariant J only gives us 
partial information about which virtual knots are classical and which aren’t. 
So what are some other virtual knot invariants? Here, we mention one called 
the intersection index polynomial [7, 14]. To define the intersection index 
polynomial, we first need to define the intersection index of a single crossing 
in a virtual knot diagram. Then, we will use the intersection indices of all of 
the crossings in a virtual knot diagram to form a polynomial invariant for that 
virtual knot. 


For a given oriented virtual knot diagram D, let us choose a classical 


crossing, d, in this diagram and smooth the crossing, as pictured in Figure 
B23. 


Figure 8.2.3: Smoothing of a crossing with numbered components. 


XIEX 


This smoothing produces the diagram of a two-component virtual link. We 
label the components of the link as follows: when the crossing that is being 
smoothed is oriented upward, the component on the left is assigned the 
number 1 and the component on the right is assigned the number 2. Just as in 
the definition of the linking number of a classical link, we restrict our 
attention to crossings involving both components. So let C, denote the set of 
(classical) crossings in this virtual link diagram that involve both components 
of the link. We assign signs a(x) to each classical crossing x in C, as shown 
in Figure 8.2.4 based on which component of the link each strand of the 
crossing belongs to. (Note that this is a different way of assigning signs to 
crossings than we are used to!) 


Now we have the following definition. The intersection index of a crossing 
D ina virtual knot diagram K is the sum of the values a(x) for all classical 
crossings x in Cg. Let’s denote the intersection index of crossing D by i(d). 
See Figure 8.2.5 for an example illustrating how to compute the intersection 
index for each of the crossings in a given virtual knot diagram. 


Exercise 8.2.13. Find the intersection index of each classical crossing in the 
virtual knot diagram pictured in Figure 8.1.5. 


Figure 8.2.4: Definition of a(x) of a classical crossing x in Cj. 


KIX 
XIX 


Exercise 8.2.14. Find the intersection index of each classical crossing in the 
virtual knot diagram pictured in Figure 8.2.6. 


Z 


Using the idea of an intersection index, we define a polynomial invariant for 
virtual knots as follows. 


Definition 8.2.15. Let the intersection index polynomial, p,(K), for virtual 
knot K with diagram D be the sum over all classical crossings D in D of the 
polynomial sign(d)(t"! — 1). In more succinct notation: 


p(k) = >, sign(d) (tM! — 1), 


d 


In this formula, the quantity sign(d) is the usual sign of the crossing d (as in 
Definition 3.4.6), and the letter t is a variable. 


For the example in Figure 8.2.5, verify the following computation. 
pi(K) = sign(a)(t*! — 1) + sign(b)(t?! — 1) 
+ sign(c)(t*©! — 1) + sign(d)(t*! — 1) 


Qe -H2Oe™ =i1)+ He" -1)+ OE" =1) 
=t?+3t-4 


Exercise 8.2.16. Show that P,(T,,) = 2t — 2 for the virtual trefoil T, by 


computing the intersection index polynomial in two different ways, using the 
two equivalent, but distinct, virtual trefoil diagrams you created in Exercise 
8.1.6. 


Figure 8.2.5: Computing the intersection index for all crossings in a virtual 
knot diagram. 


i(a) =3 i(b) =-1 
i(c) =-1 i(d) =-1 


Exercise 8.2.17. Find p,(K) for the virtual knot K shown in Figure 8.1.5. 


Exercise 8.2.18. Find p,(K) for the virtual knot K shown in Figure 8.2.6. 


What is J (K) for this virtual knot? 


Exercise 8.2.19. Prove that p,(K) does not depend on the virtual knot diagram 
you choose to represent K. In other words, p,({K) is a virtual knot invariant. 


(Hint: Proceed with your proof by determining the effect of each classical and 
virtual R-move on the value of the polynomial. In particular, explain why the 
—1 term is essential for invariance of p,(K).) 


Figure 8.2.6: Another example of a virtual knot. 


(~ 


The intersection index polynomial has several interesting properties, as we 
see in Theorem 8.2.20. 


Theorem 8.2.20. 1. The invariantp,(K) = 0 if K is a classical knot. 


2. The invariant pK) is strictly stronger than J, meaning that if J can be 
used to prove that two virtual knots K and K' aren’t equivalent (by 
showing J(K) 4 J(K’), so can p,(K) (by showing P,(K) # P,(K’)). 
Furthermore, there are nonequivalent virtual knots that pK) can 
distinguish that J can’t. (See Exercise 8.2.18.) 


The first statement in Theorem 8.2.20 can be proven by using the famous 


Jordan Curve Theorem to show that the intersection index of any crossing in 
a Classical knot diagram is 0, but let’s focus on the second statement. Can you 
prove it? 


Exercise 8.2.21. Prove statement 2 in Theorem 8.2.20. (Hint: Consider the 
terms corresponding to odd crossings in p,(K).) 


In this section, we have defined several virtual knot invariants that give us 
information about the “virtualness” of nonclassical virtual knots, but that are 
uninteresting for classical knots. There are many more virtual knot invariants 
that have been studied that not only distinguish nonclassical virtuals, but that 
can also distinguish classical knots from one another. For instance, there are 
several ways to extend the Jones polynomial so that it is an invariant of 
virtual knots but it agrees with the Jones polynomial for all classical knots. 
See [10, 21] for various virtual enhancements of the Jones polynomial. Other 
interesting virtual knot invariants can be found in [11, 21, 34] and elsewhere. 


8.3. Virtual Unknotting 


In Chapter 7, we delved into some unknotting operations for classical knots. 
In particular, we learned about the crossing change (CC) move that can be 
used to turn any knot diagram into a diagram of the unknot. Perhaps 
surprisingly, the CC move is not an unknotting operation for virtual knots. 
We can show this using one of our favorite examples. 


Exercise 8.3.1. Consider the virtual knot diagram D in Figure 8.2.5. We 
computed p,(K) for this virtual knot using D and saw that p,(K) ~ 0, thus 


proving that the virtual knot is nonclassical. In particular, we proved that the 
virtual knot is not the unknot. Show that, for every possible virtual knot 
diagram D’ that is related to D by some number of CC moves, the intersection 
index polynomial is nonzero. Thus, D cannot be unknotted using CC moves. 


Another famous example of a nonclassical virtual knot that cannot be 
unknotted with CC moves is Kishino’s knot, pictured in Figure 8.3.1 and 
first introduced in [24]. Unlike the example in Figure 8.2.5, more powerful 
virtual knot invariants than J or p, are needed to prove that Kishino’s knot is 


nontrivial. (See, for instance, [4, 10].) 


Figure 8.3.1: Kishino’s knot. 


Exercise 8.3.2. Prove that neither the odd writhe nor the intersection index 
polynomial can distinguish Kishino’s knot from the unknot. 


If we take a closer look at Kishino’s knot, we see that the reason this virtual 
knot diagram is nontrivially and virtually knotted is a consequence of the fact 
that certain Reidemeister-type moves are forbidden. In Exercise 8.1.7, you 
showed that four of the potential virtual Reidemeister moves fail to describe a 
virtual knot equivalence. Two of these moves are particularly interesting: 
moves (e) and (f). These two moves are commonly referred to as the 
forbidden moves. Notice that if either of the forbidden moves were 
allowable as virtual R-moves, Kishino’s knot would be unknottable. 


Exercise 8.3.3. Show that Kishino’s knot can be unknotted using virtual 
Reidemeister moves together with one or both of the forbidden moves, i.e., 
moves (e) and (f) in Figure 8.1.7. 


Since we are able to unknot Kishino’s knot using forbidden moves, we might 
ask, are there other virtual knots that can be unknotted with forbidden 
moves? The answer is a resounding “Yes!” Sam Nelson proved the following 
theorem in [29]. 


Theorem 8.3.4. If both of the forbidden moves are added to the collection of 
classical and virtual Reidemeister moves, then every virtual (and classical) 


knot can be unknotted. 


The way Nelson proved this fascinating result is actually quite simple. He 


showed that the Gauss diagrammatic forbidden moves along with Gauss 
diagrammatic Reidemeister moves can be used to rearrange all arrows in any 
Gauss diagram so that they become nonintersecting. As you proved in 
Exercise 4.2.7, such a Gauss diagram must represent an unknot. Let’s 
reconstruct the details of Nelson’s proof. 


Exercise 8.3.5. Use the Gauss diagram schema you derived in Exercise 8.1.7 
for forbidden moves (e) and (f) in Figure 8.1.7 along with Gauss 
diagrammatic R-moves from Figures 4.2.5, 4.2.4, and 8.3.2 to justify each of 
the moves in the Gauss diagram sequence shown in Figure 8.3.3. In other 
words, identify moves i, ii, iii, iv, and v if n is replaced by + and — n is 
replaced by —. Similarly, identify moves i, ii, iii, iv, and v if n is replaced by — 
and — n is replaced by +. 


Exercise 8.3.6. In Exercise 8.3.5, Gauss diagrammatic R-moves and 
forbidden moves were used to move the head of an arrow past the tail of an 
arrow in a Gauss diagram when the arrows have the same sign. Now, prove 
that the head of an arrow can be moved past the tail of an arrow that has the 
opposite sign (as shown in Figure 8.3.4) by providing a sequence of Gauss 
diagrammatic R-moves and forbidden moves similar to the one shown in 
Figure 8.3.3. 


Before we put all of the pieces together and complete our proof, let’s see how 
these derived forbidden moves are able to unknot several nontrivial virtual 
knots. 


Figure 8.3.2: Four Gauss diagrammatic Reidemeister 3 moves that do not 
appear in Figure 4.2.5, but can be derived from the minimal generating set 
shown in the figure. (Did you find all four of these moves in Exercise 4.2.9?) 


Figure 8.3.3: A sequence of Gauss diagrammatic Reidemeister and forbidden 
moves. 


--. 


Exercise 8.3.7. Unknot the two nontrivial virtual knots in Figure 8.1.3 using 
the derived forbidden moves from Figures 8.3.3 and 8.3.4. 


Exercise 8.3.8. Use Exercises 4.2.7, 8.3.5, and 8.3.6 to prove Theorem 8.3.4. 


It is clear now why we don’t allow forbidden moves to be used to 
characterize virtual knot equivalence. Curiously, if just one of the two 
forbidden moves is added to the list of classical and virtual R-moves, the 
resulting knot theory isn’t trivial. (Recall that we needed both forbidden 


moves in order to complete our proof above.) In fact, this new kind of knot 
theory is quite interesting! It is referred to as the theory of welded knots [18]. 


Figure 8.3.4: A move that can be derived from a sequence of Gauss 
diagrammatic Reidemeister and forbidden moves. 


We just showed that the pair of forbidden moves can unknot any virtual knot. 
Just as in classical knot theory, there are many types of unknotting 
operations. Let’s look at a different flavor of unknotting operation for virtual 
knots called virtualization. When a classical crossing in a virtual knot 
diagram is replaced by a virtual crossing, as in Figure 8.3.5, we call this a 
virtualization of the crossing. 


Figure 8.3.5: A classical crossing becomes a virtual crossing via the 
virtualization operation. 


—> 


In Exercises 8.1.4 and 8.1.5, we proved (perhaps without realizing it) that 


virtualization is an unknotting operation. Indeed, if all or nearly all of the 
classical crossings in a virtual knot diagram are virtualized, then the diagram 
becomes the unknot. An interesting question emerges from this idea: can we 
unknot a given virtual knot diagram with fewer virtualization moves? 


Exercise 8.3.9. Perform some number n < 3 of virtualization moves on the 
diagram of Kishino’s knot in Figure 8.3.1 to transform the diagram into an 
unknot. Show, using virtual R-moves, that your virtualized diagram is indeed 
the unknot. 


Given that the virtualization move is a virtual unknotting operation, we can 
define the virtual unknotting number of a virtual knot K. 


Definition 8.3.10. The virtual unknotting number, u,(K), of K is the 


minimum number of virtualization moves that must be performed on a 
diagram D of K in order to produce the unknot. Here, the minimum is taken 
over all possible diagrams D of K. 


Exercise 8.3.11. Make a conjecture about the value of u,(K) for the virtual 
knot K shown in Figure 8.1.5. 


Exercise 8.3.12. Provide an infinite family of virtual knots with virtual 
unknotting number 1. (Hint: There are two parts to this problem. First, use 
the odd writhe or the intersection index polynomial to prove that all of the 
virtual knots in your family are distinct. Second, prove that each member of 
the family does indeed have virtual unknotting number 1.) 


Finally, let’s think about the virtual unknotting number more generally. 


Exercise 8.3.13. In Section 7.2, we defined the trivializing number t (D) of a 
knot diagram D via its Gauss diagram Gp. Since t(D) is defined in terms of 
Gauss diagrams, its definition can be extended to virtual knots. Use the 
trivializing number to find an upper bound on the virtual unknotting number 
of a virtual knot. 
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1 Introduction 


Is the “cable spaghetti” on the floor really knotted 
or is it enough to pull on both ends of the wire 
to completely unfold it? 


Since its invention, knot theory has always been a place of interdisciplinarity. The first 
knot tables composed by Tait have been motivated by Lord Kelvin’s theory of atoms. 
But it was not until a century ago that tools for rigorously distinguishing the trefoil 
and its mirror image as members of two different knot classes were derived. In the 
first half of the twentieth century, knot theory seemed to be a place mainly driven by 
algebraic and combinatorial arguments, mentioning the contributions of Alexander, 
Reidemeister, Seifert, Schubert, and many others. Besides the development of higher 
dimensional knot theory, the search for new knot invariants has been a major endeav- 
our since about 1960. 

At the same time, connections to applications in DNA biology and statistical 
physics have surfaced. DNA biology had made a huge progress and it was well un- 
derstood that the topology of DNA strands matters: modeling of the interplay between 
molecules and enzymes such as topoisomerases necessarily involves notions of ‘knot- 
tedness’. 

Any configuration involving long strands or flexible ropes with a relatively small 
diameter leads to a mathematical model in terms of curves. Therefore knots appear 
almost naturally in this context and require techniques from algebra, (differential) 
geometry, analysis, combinatorics, and computational mathematics. 

The discovery of the Jones polynomial in 1984 has led to the great popularity 
of knot theory not only amongst mathematicians and stimulated many activities in 
this direction. Many deep connections between the Jones polynomial and quantum 
physics were found. In the 1990s, monographs of Adams, Kauffman, and Livingston 
addressed a wide audience of mathematicians, from undergraduate students to re- 
search professionals, disseminated new results and ideas and thereby added to the 
perception that knot theory is a vivid discipline having impact far beyond its roots. 

Today we find that knots appear in almost all mathematical disciplines, having 
important applications in the sciences and, most importantly, that concepts from one 
field have impact on problems in other areas. Even more, we will see that applica- 
tions do not only make use of existing theories developed in entirely theoretical frame- 
works, but that questions from the sciences also stimulate theoretical developments 
in turn. 


In this edition we focus on four aspects that thematically contour its basis and back- 
ground and these will be outlined below. Of course, this choice is not meant to be ex- 
haustive, e.g., we do not cover recent developments in the theory of low-dimensional 
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topology and restrict ourselves to one of the most thrilling and active parts of algebraic 
knot theory: Khovanov homology. 


Applications in the sciences 


As already pointed out, the initial development of knot theory has been driven by 
physical motivation. In the second half of the last century, many additional fields of 
application have emerged. We will illustrate this variety of them by commenting on 
some examples. 

At the beginning of the 1960s, it was realized that many molecules consist of long 
polymeric chains. The general aim is to investigate their structure and the relationship 
of the observed structures to biological features, their formation, and function. In this 
context, the question about the likeliness of knots in these structures arose almost 
naturally. 

In this time, Frisch and Wasserman as well as Delbriick stated that ring polymers 
would be knotted with probability increasing to one as their length tends to infin- 
ity. This conjecture stimulated the study of polygonal knots and could be settled only 
about thirty years later. 

An important point in this context is to identify and characterize “knotting” us- 
ing a topological concept that is applicable to open protein knots. Several approaches 
have been developed to achieve this goal. As polymers are often modeled by open poly- 
gons, there is a need to generalize quantities used for the characterization of closed 
curves, rods, and ropes (such as writhe, self-linking number, and linking number). In 
order to consider a large variety of possible states, random walks regarding polygons 
have successfully been investigated and are still a subject of recent activity. Certain 
knot invariants such as the Jones polynomial have been employed in algorithms to 
automatically determine the knot class of a randomly chosen polygon. 

In certain situations one can in fact work with closed curves. Since the 1960s it 
has been known that there exist some specific circular DNA molecules. In other cases 
the two sticky ends of the DNA are very close to each other, so modeling by a closed 
curve seems reasonable. 

According to experimental observations, molecules forming more complex knots 
migrate faster than those with less complex knots. The speed seems to be proportional 
to the average crossing number of the corresponding ideal knots. The latter ones min- 
imize the ropelength, i.e., the quotient of length over thickness of a given knot. While 
this definition seems to be conceptionally elementary, it becomes quite involved from 
an analytical point of view due to its non-smoothness. This is a major issue in geo- 
metric knot theory which will be addressed below. The average crossing number is a 
frequently used and studied functional in polymer topology and turned out to be a 
good measure of polymer compaction. 
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Long DNA chains are often packed compactly under extreme confining condi- 
tions. Although the general DNA packing mechanism still lacks a complete under- 
standing, there have been recent attempts to develop a corresponding DNA packing 
model and test whether the kind of knot spectrum observed in the experiments can 
be computationally reproduced by studying equilateral random polygons confined in 
a sphere. 

Knots and the concept of ropelength also appear in the context of mathematical 
physics, for instance in ideal magnetohydrodynamics, with potential applications in 
the study of astrophysical flows. Magnetic knots are tubular embeddings of the mag- 
netic field in some torus centered on a smooth oriented loop that is knotted in the fluid 
domain. They are driven by the Lorentz force to inflexion-free spiral knots and braids. 
The topological crossing number provides a lower bound on the minimum magnetic 
energy of knots. Combining results on magnetic energy relaxation and data on numer- 
ical knot tightening reveals relationships between ropelength and groundstate energy 
minima. 

Models of closed rods in elasticity theory exhibit knotted configurations when ap- 
plying a suitable amount of twist in the case where self-avoidance is not present. Im- 
posing repulsive forces leads to a variety of different shapes and bifurcation phenom- 
ena which has not been fully understood yet. 


Geometric knot theory 


In the 1990s, the notion of knot energies led to the definition of many functionals on 
a suitable space of knots, so calculus of variations came into play. Originating from 
an idea of Fukuhara, the general aim is to investigate geometric properties of a given 
knotted curve in order to gain information on its knot type. This new subfield emerged 
from the search of particularly “nice” shapes of knots. In a broader sense, it is part of 
geometric curvature energies which include geometric integrals measuring smooth- 
ness and bending for objects that a priori do not have to be smooth, also covering the 
higher-dimensional case. 

Moffatt speculated that the DNA molecule seeks to attain some minimum energy 
state. Up to now, the energy involved here has not been identified, but this idea is one 
of the main motivations to study repulsive functionals in geometric knot theory. There 
have been several attempts to employ self-avoiding energies for mathematical models 
in microbiology. 

In fact, knot energies basically model self-avoidance, i.e., the functionals blow up 
on embedded curves converging to a curve with a self-intersection. Following the neg- 
ative gradient flow of this functional should simultaneously “untangle” the curve and 
prevent it from leaving the ambient knot class. As long as there are no saddle points 
of the energy within the unknot class, any such flow is a candidate for a retraction of 
the unknot class onto the circles which exists due to the Smale conjecture. 
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In this context, questions of existence and regularity of minimizers are of great 
interest. The first knot energies defined on smooth curves by O’Hara (which contain as 
a special case the famous “Mobius energy”) provide several interesting features. The 
definition is quite straight-forward: one penalizes distant strands of the curve with a 
small euclidean distance. It was the seminal discovery due to Freedman, He, and Wang 
that the Mobius energy is invariant under the full group of Mobius transformations 
in R? which paved the way to proving deep theorems on existence and regularity of 
minimizers within certain knot classes essentially by geometric means. 

The conjecture that the Mobius energy of links is minimized by the stereographic 
projection of the standard Hopf link has been settled using the min-max theory of 
minimal surfaces which had been developed for proving the Willmore conjecture by 
Marques and Neves. There are still other open hypotheses concerning the Mébius en- 
ergy, for instance the non-existence of minimizers in composite knot classes as stated 
by Kusner and Sullivan. 

Ropelength, mentioned above, albeit conceptually being the most elementary 
knot energy one can think of, turns out to be analytically quite involved due to taking 
suprema which is a non-smooth operation. The existence of ideal knots (ropelength 
minimizers within a given knot class) has been established independently by several 
research groups. The problem of ropelength criticality is quite hard—due to the lack of 
an explicit analytical characterization of the shape of a (non-trivial) ideal knot, many 
contributions focus on discretization and numerical visualization. 

Analytically, the situation becomes more accessible when suprema in the defini- 
tion of ropelength are replaced by integration. However, the regularity properties of 
these (families of) functionals still lack complete understanding. Additionally, several 
generalizations to higher dimensions have been studied. Especially the investigation 
of the integral Menger curvature reveals interesting connections to deep questions in 
complex analysis such as Painlevé’s problem. 

Up to now, gradient flows have been analyzed mainly for a sub-family of O’Hara’s 
energies. Albeit many numerical schemes rely on suitable gradient-based discretiza- 
tions, there is still a lack of a rigorous numerical analysis. 

There is a close connection between the geometric setup in knot theory and re- 
cently developed methods from analysis of non-local critical partial differential equa- 
tions. More precisely, techniques in the setup of so-called fractional harmonic maps 
could be extended to the Mobius energy to prove regularity for critical points. This was 
quite surprising, and it opens up a new link from knot theory to classical geometric 
analysis, making available the well-studied techniques from harmonic maps and har- 
monic analysis. This is an active area of research, with the hope of eventually being 
able to solve open problems with the well-developed tools from classical geometric 
and harmonic analysis, where purely knot-related geometric arguments fail. 
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Connections to differential geometry 


A classical example for the impact of knottedness to the geometry of curves is the 
Fary—Milnor theorem: The total curvature of a curve is bounded from below in terms 
of the bridge number. Consequently, any curve having total curvature less than 47 
must be unknotted. 

A more subtle consequence of knottedness is the existence of quadrisecants, i.e., 
straight lines intersecting the curve at least four times, which has been established by 
Pannwitz (1933). Surprisingly, there are many of them. Denne sharpened this result a 
couple of years ago to stating that a (tame) knotted curve even possesses an alternat- 
ing quadrisecant. In fact, quadrisecants greatly improve the known lower bounds on 
ropelength. 

In the same spirit, configuration spaces come into play when proving that Jordan 
curves of finite total curvature (without cusps) have inscribed squares. 


Combinatorial knot theory 


Classically, a knot invariant is a functional f mapping a given knot k to some object f(k) 
with the property that f(k,) = f(k2) if the knots k,, ky belong to the same knot class. 
For instance, we have f(k) € {yes, no} in case of tricolorability, and the fundamental 
group of the complement 7; (S?\k) is another knot invariant. Up to now, there is still no 
easily computable knot invariant that allows to distinguish any two given knot classes. 
Therefore, in order to capture features of different knot types, it is of great interest to 
define and investigate new knot invariants. 

They all contribute to a sort of big ‘tool box’ which can be applied to further char- 
acterize knot classes and add valuable pieces to the ‘puzzle’ that knot theory some- 
times consists of. 

Although the definition may only involve quite elementary geometric means, the 
corresponding theory turns out to be rather challenging. This applies, e.g., to the stick 
index which is the least number of line segments required to build the knot in space. 
A related quantity is the cubic lattice stick index. Another example is the arc index, 
the minimum number of pages required to present a given knot type, where an arc 
presentation is an embedding k of a knot or link in finitely many pages of an “open- 
book” so that each of its pages meets k in a single simple arc. This particular invariant 
has a long history dating back to Brunn who looked for a projection of a given link 
with a single singular point of higher multiplicity. 

Sometimes even the definition of a knot invariant may demand deep algebraic 
techniques. In the early 1980s, Jones established a connection to statistical mechanics. 
A system of particles where only neighbors interact can be modeled by a lattice or, 
more generally, a graph. If the partition function, the sum over all possible states of 
this system, can be chosen to respect Reidemeister moves, it becomes a knot invariant. 
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An example is the Potts model which is intimately related to knot theory, more 
precisely to the Jones polynomial and to the Khovanov homology, providing a link be- 
tween algebraic topology and theoretical physics. The Khovanov homology is a rather 
new knot invariant introduced by Khovanov in the 1990s and can be obtained from 
categorification of the Jones polynomial. 

Exciting physical interpretation and connections of Khovanov homology using 
avant-garde ideas from for example quantum fields and strings have already been 
found. And many new exciting developments in this directions are still to be expected. 
Or, as Edward Witten put it in a talk on knot and quantum theory at the Institute of 
Advances Study in October 2011: “Probably the full story involves physics ideas that 
we do not understand yet.” 


The aim of this book is to present short surveys and open questions on different ac- 
tive research directions related to knot theory in the broadest sense that are thought 
of as invitation and challenge to junior and senior researchers. In this sense, it com- 
plements the existing introducing (e.g., the above-mentioned monographs) and more 
advanced textbooks (like Crowell—Fox, Rolfsen, Burde—Zieschang, and many others). 

On the other hand, this collection emphasizes connections between the different 
fields and provides a prototypical example for the unity of mathematics. In this light, 
distinguishing between “pure” and “applied” fields does not seem to be a meaningful 
category. 

Joining certain aspects of knot theory, this presentation shall provide an overview 
on the state of the art, popularize open problems, and reveal synergies and future 
directions. 

The fields of modern knot theory use quite different tools. However, each such 
area is deeply grounded on results from the others, so new developments will certainly 
foster prospective research. To bring together those various aspects seems to us likea 
great opportunity for all sides. 

This edition is intended as a platform to exchange and update on ideas, open prob- 
lems, and interesting directions on which future collaborations over the borders of 
areas can be based. In the past, many of the contributing authors have successfully 
collaborated in different formations which adds to the flexibility and vitality of this 
area. 

We would be glad if this collection would further stimulate research activities and 
help disseminate ideas between different areas related to knots and knotted struc- 
tures. 


We are very much indebted to all authors who accepted the challenge of presenting 
the state of the art of their field of expertise to a broader readership as well as to our 
referees for their thorough work. 

This edition has been initiated on the occasion of the workshop Geometric Ener- 
gies with Links to Applications, Topology, and Open Problems which has been held in 
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Pawet Strzelecki and Heiko von der Mosel 
Geometric curvature energies: facts, trends, 
and open problems 


Abstract: This survey focuses on geometric curvature functionals, that is, geo- 
metrically defined self-avoidance energies for curves, surfaces, or more general k- 
dimensional sets in R@. Previous investigations of the authors and collaborators con- 
centrated on the regularising effects of such energies, with a priori estimates in the 
regime above scale-invariance that allowed for compactness and variational appli- 
cations for knotted curves and surfaces under topological restrictions. We briefly de- 
scribe the impact of geometric curvature energies on geometric knot theory. Currently, 
various attempts are being made to obtain a deeper understanding of the energy land- 
scape of these highly singular and nonlinear nonlocal interaction energies. Moreover, 
a regularity theory for critical points is being developed in the setting of fractional 
Sobolev spaces. We describe some of these current trends and present a list of open 
problems. 


Keywords: geometric curvature energies, singular integrals, critical points, regularity 
theory, geometric knot theory, elastic knots, rectifiability 


2.1 Facts 


Energies. Geometric curvature functionals are characterised as geometrically defined 
energies on a priorily non-smooth k-dimensional subsets 2" of R¢, and these func- 
tionals are designed to penalise self-intersections. In addition, there are regularising 
effects: finite energy implies some higher degree of smoothness of 2. One of the first 
examples is that of the Mébius energy on rectifiable curves 7 c R4, introduced by J. 
O’Hara [82] and investigated analytically by M. H. Freedman et al. [41, 50], 


40 | [gaa gegen) 
yoy 

where d(x, y) denotes the intrinsic distance between the points x, y on the curve 7, 

and .#! stands for the one-dimensional Hausdorff-measure. The first summand in the 

integrand resembles a Coulomb-type repulsive potential suitably regularised by the 

second term so as to obtain finite energy for smooth embedded curves. Another exam- 
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ple is that of ropelength”! defined as the quotient of length “(y7) and thickness A[7] of 
curves 1, where the latter is a non-smooth functional introduced by O. Gonzalez and 
J. H. Maddocks [48], defined as 


Aly] := inf R(x, y, Z). (2.1.2) 
Retain 


Here, R(x, y, z) denotes the circumcircle radius of the three curve points x, y, and z. 
Thickness can be regarded as a hard core potential or steric constraint, in contrast to 
repulsive potentials. 

Both functionals have had their impact on the modeling of macromolecules such 
as DNA and proteins [25, 76, 77], and on geometric knot theory, where one studies 
relations between the geometry of space curves and the knot types they represent. 
Several geometric curvature energies can be minimised within given knot classes to 
obtain particularly nice representatives of that knot, for example ideal knots as ro- 
pelength minimisers [29, 49]. Bounds on the energy sometimes imply bounds on knot 
invariants like stick number or crossing number, hence bounds on the number of knot 
classes that possess representatives below these energy thresholds; see, e.g., [24, 73]. 

However, both these extreme forms of energies have serious drawbacks. The 
highly singular integrals involved in the definition of any kind of repulsive potential 
like (2.1.1) need some sort of regularisation, and — besides the ambiguity in the choice 
of such a regularisation — it is by no means clear how to generalise this concept to 
higher dimensional objects. The steric constraint of given thickness (2.1.2) or the rope- 
length functional, on the other hand, is a non-smooth quantity imposing challenging 
technical problems, e.g., for the derivation and analysis of variational equations. This 
led to our systematic research between 2007 and 2012, devoted to a whole range of 
intermediate energies on curves and surfaces interpolating in some sense between 
hard steric constraints and “soft” repulsive potentials. Examples of such energies on 
one-dimensional sets include [119] 


= 1 1 
Uply) = / SP ayy iew Pel (2.1.3) 
tw eee 
or the double integral [114] 
1 1 
Fl) := i | sup Wey DOW, p22, tM) 


zx fy #Z 


2.1 The name of that functional is coined after the following geometric variational problem: given a 
rope of fixed constant thickness, what is the minimum length of this rope required to tie a given knot? 
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and also integral Menger” curvature [115] 


= 1 1 1 1 _ 
Mp y) = | / / Payne (dH (y)dH#'(z), p>3. (2.1.5) 


op ieee fae 


On a fixed loop + of unit length, these energies are ordered as 


My!?(y) < Jy"? (y) < Up!P() < a (2.1.6) 


ly? 
where the last term is the ropelength of +7. Moreover, the p-th root of Zp, -%, and of 
Mp tends to ropelength as p — oe both on fixed conformations of knots and in the 
sense of I’-convergence. 
Besides averaging and maximising over the multi-point interactions in the circum- 
radius we investigated tangent-point interactions such as [121, 57] 


= 1 1 1 : 
p(y) := / / mage (JdxH"(y), p>2, (2.1.7) 
yy 


as well, where rtp(x, y) is the radius of the unique circle through two given curve points 
x and y that is additionally tangent to y in x. 

We also introduced and studied geometric curvature energies on higher- 
dimensional sets such as thickness for surfaces © C R? [118, 117], where one minimises 
over all pairs of points x, y € 2 the tangent-point radius Rtp(x, y) of the smallest sphere 
through x and y that is tangent to ¥ in x. Later, we investigated integral Menger curva- 
ture for surfaces © c R? [120], 


PACE / i: if / K(x, y, 2 I-00 EVA (DAHA), p28, (2.1.8) 


rorr s 
where the integrand X is defined?? on tetrahedra T = (x, y,z, €) with vertices 
xX, y,z,€ €X,as 
ys 
K(T) = K(x, y,2,€):=2 2) —_ (2.1.9) 
area (T)diam “(T) 


2.2 Karl Menger considered in the 1930’s the circumradius R(x, y, z) of three curve points x, y, z € 
knowing that the coalescent limit of R(x, y, z) as x and y tend to z coincides with the local radius 
of curvature if the curve + is sufficiently smooth. Menger was also aware of the fact that there is an 
elementary formula for the circumradius solely in terms of the mutual distances of the points x, y, 
and z. By means of multipoint functions such as R(., -, -) Menger indeed intended to develop a purely 
metric geometry in contrast to classic differential geometry. The idea of using Menger curvature as a 
tool — both in harmonic analysis and in modeling — has been re-discovered in the last 20 years. 

2.3 The most obvious choice to take as integrand in (2.1.8) a negative power of the circumsphere radius 
of a tetrahedron does not serve our purposes since there are smooth embedded surfaces for which 
such an integrand would not be bounded; see our detailed discussion on various integrands in [120, 
Appendix B]. 
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Here .#3(T) is the volume of the tetrahedron T and area (T) the sum of its facet areas. 
For k-dimensional submanifolds 2* c R? we looked at the tangent-point energy [123] 


_ 1 k k 
ép(2) : i | RPO) Ga d™(x)\dH#"(y), (2.1.10) 


and S. Kolasifski investigated in his Ph.D. thesis integral Menger curvature for 3* c R@ 
[61, 62] 


Mie ‘) su yf KP (X05 ewe» Xp) IK (Xo) © AIK (put) (2.1.11) 
ey 


k + 2 integrals 


for p > k(k + 2), where the integrand generalises (and simplifies) the one given in 


(2.1.9) to 


k+1 
KD) = KOojsserXeea) = Gmatyes (2.1.12) 


for (k + 1)-dimensional simplices T = (xo, ..., X41) with each vertex x; on 2. 


Regularising effects. Summarising the essential results of this systematic re- 
search (which is well documented in a number of publications [119, 114, 115, 121, 117, 
118, 120, 123, 61, 62], we can say the following: we have a pretty clear understanding of 
the topological and regularising effects of each of these energies, with sharp regular- 
ity statements and uniform a priori estimates. For example, a rectifiable curve y with 
finite integral Menger curvature .%)(y) for some p > 3 (i.e., above the scale-invariant 
case p = 3) is homeomorphic to the unit-circle or unit-interval, and the arclength 
parametrisation of that curve satisfies the uniform a priori estimate 


ly’(s)- (| < Gols, t))/?|s - t+ 3? forall s, t. (2.1.13) 


In other words, y € C1!-G/?), so although .% does not capture the pointwise value 
of local curvature (which may be simply undefined even if .%)(v7) is finite), it does 
capture the average oscillation of the unit tangent vector; see [115, Theorem 1.2]. We 
may interpret this result as a geometric Morrey-Sobolev embedding: the integrand cor- 
responds to a very weak form of curvature integrated to some power p > 3, and the 
total domain of integration is three-dimensional; the classic Morrey-Sobolev theorem 
applied to second derivatives (instead of curvature) would give exactly the optimal 
HGlder exponent 1 — (3/p) for the first derivatives. 

Likewise for higher-dimensional subsets of R?, exemplified by integral Menger 
curvature .%»(2) for two-dimensional surfaces in Euclidean 3-space, as defined in 
(2.1.8); see [120, Theorem 1.4]: if an admissible two-dimensional set © c R? satis- 
fies M@p(2) < co for some p > 8 (again above the scale-invariant case p = 8), then 
>» is actually an orientable C!!~°/?)-submanifold with a controlled local graph rep- 
resentation: There is a uniform radius R > O depending only on p and the energy 
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value .%p(2), such that for each x € ¥ the intersection Br(x) NZ equals the graph ofa 
c1-1-8/P).function with uniform estimates on this function solely depending on p and 
My(x). Again, this is a geometric variant of the Morrey-Sobolev embedding theorem 
with optimal Hdlder exponent 1 — (8/p) for the oscillation of tangent planes. Similar 
results hold for k-dimensional admissible sets X* c R? with finite integral Menger 
curvature .%,(2) for p > k(k + 2), or finite tangent point energy 4(2) for p > 2k, 
both in the regime above scale-invariance; see [61, 62, 123]. A thorough discussion of 
the respective admissibility class of sets can be found, e.g., in [64]. At this point, we 
may roughly describe our mild requirements on the set >“ as a certain degree of local 
flatness around many (but not all) points, together with an amount of connectivity to 
allow for some degree-theoretic arguments. 

That these regularity estimates are indeed sharp, can be seen either by explicit 
examples constructed in [66, 57], or by the complete characterisation of energy spaces 
for all these energies in the work of S. Blatt and Kolasinski [17, 13, 11]: based on our 
results that finite energy implies that the admissible sets are already C!-submanifolds 
in R4, they use the explicit structure of the energies to estimate locally the seminorms 
of fractional Sobolev spaces to find that X has finite energy if and only if x is embedded 
and has local graph representations of exactly that Sobolev regularity. Recall, e.g., 
from [128, Section 2.2.2], that a function u € L?(R*) belongs to the Sobolev-Slobodeckii 
space W™*S-P(R*) for some m € N, s € (0, 1), and p € [1, oe) ifu belongs to the classic 
Sobolev space W”? and satisfies, in addition, 


Peay yP |D“u(x) - D*u(y) |? 
ullPresip = Wel emo) + So J/ ope aydx<ee, 14) 


|a|=mpx Rk 


As an example, let us mention Blatt’s and Kolasinski’s characterisation of the energy 
space for integral Menger curvature as defined in (2.1.11); see [17, Corollary 1.2]: If 
p > k(k + 2) and =* Cc R24 is an admissible set, then its integral Menger curvature 
My)(2) is finite if and only if X is a submanifold with local graph representation of class 
wi*sPcRK Rd-*), wheres = 1-(k(k+1)/p) € (0, 1). Blatt and Kolasifiski treated also 
all intermediate energies where up to all but two integrations in (2.1.11) are replaced 
by maximisations, obtaining corresponding fractional Sobolev spaces as the exact en- 
ergy spaces with suitably adapted differentiability and integrability. The missing cases 
where only one integration is left, i-e., 


t sup KP (x0, X15 0+ + Xk) dH" (x0), (2.1.15) 
x1 


generalising (2.1.3) to k-dimensional sets X* c R@, and the global tangent-point energy 
(cf. (2.1.10)), 


[ow oo d.#*(x), (2.1.16) 
J yer Rip Y) 
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were treated in cooperation with Kolasifski in [64] leading to the theorem that fi- 
nite energy characterises embedded submanifolds of classic Sobolev regularity W?? 
if p > k; see [64, Theorem 1.4]. This result may be compared to Allard’s famous C)?- 
regularity theorem [3, 37] for k-dimensional varifolds whose generalised mean curva- 
ture is p-integrable, where, again, p > k. 


Connections to geometric knot theory. The uniform estimates obtained in 
[119, 114, 115] for finite energy curves + (like the one in (2.1.13)) together with a uni- 
form geometric rigidity of these curves (replacing the excluded volume constraint of 
thickness) was used to connect the respective energies to geometric knot theory, as de- 
scribed in detail in [116, Section 4]; see also the recent surveys [122, 124]. This geometric 
tigidity** means, roughly speaking, that the curve may be equipped with a necklace 
of consecutive double-cones whose size and opening angle are determined purely in 
terms of the respective energy [116, Proposition 4.7]. The circular cross-sections of each 
piece of this necklace, i.e., of each such double cone (with its two tips located on the 
curve +), are intersected by y transversally and exactly in one point; see Figure 2.1. 
Once this necklace is established one can fairly easily construct an ambient isotopy 
from ¥ to the inscribed polygon made of the consecutive double cones’ axes. 

Thus, any one of the geometric curvature energies for curves (2.1.3)—(2.1.5), (2.1.7), 
bounds the stick number, which is the minimal number of straight segments you need 
to build a polygonal representative of the same knot type. Since stick number is a knot 
invariant, any such energy bounds the number of knot types: given any constant E = 0 
there is a nonnegative integer N(E) depending only on E, such that at most N(E) knot 
types can be represented by curves of geometric curvature energy below the energy 
threshold E. 

On the other hand, the double-cone property described above also serves as a 
substitute of the excluded volume constraint given by finite ropelength. This allows 
us to control the average crossing number acn(7), where you count the number of self- 
intersections of every planar projection of the given curve y and then average over all 
directions of projections. Indeed, Freedman et al. derived in [41, Section 3] a double 
integral formula for acn(y), 


1 


1 

foe I's) x 7'(0) - (= (8) 

acn(y) = 7 7 / WOEEOW ds dt, (2.1.17) 
0 


0 


where x denotes the usual cross-product in R*. While the local interaction terms in 
that formula may be estimated by the local smoothness properties of a finite energy 
curve 7, one can follow the strategy of G. Buck and J. Simon [24] for curves of finite 
ropelength, to estimate the global interaction terms by estimating the volume of a 
spatial region necessary to fit in a maximally compactified curve 7. Only here, one 


2.4 Referred to as diamond property in [116]. 
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Fig. 2.1: Top: The curve ¥ is trapped in a conical region with two tips at x, y € + and does not me- 
ander back and forth: each cross section of the double-cones contain exactly one point of the curve. 
Bottom: Necklace made of such small double cones with vertices along the curve y that have pair- 
wise disjoint interiors. The polygonal curve joining the consecutive vertices of the cones is ambient 
isotopic to +. 


has to replace the excluded volume constraint by the double-cone condition, so that 
our constants are far from being optimal; see [116, Proposition 4.13]. Since the aver- 
age crossing number bounds the classic knot invariant crossing number, we thus have 
established another means to control the number of knot types below given energy 
thresholds. 

In addition, we could show that all these energies are charge and tight, which 
means that they blow up along sequences that converge to curves with self- 
intersections and also along sequences where one small knotted subarc pulls tight, 
i.e., vanishes in the limit. Being tight distinguishes these geometric curvature energies 
from the Mobius energy (2.1.1): O’Hara showed in [83, Theorem 3.1] that the Mobius 
energy does not prevent the pull-tight phenomenon. Moreover, % and .% could be 
shown to distinguish the knot from the unknot: there is a gap between the infimum 
over unknots and the infimum over non-trivially knotted curves. The infima of all these 
energies (in contrast, e.g., to the Mobius~° energy) are attained on each given knot 


2.5 The Mobius energy can, however, be minimised within prescribed prime (or irreducible) knot 
classes; see [41, Theorem 4.3]. 
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class. Freedman et al. [41] showed that the Mobius energy Gygp is uniquely minimised 
by the round circle, a knot energy with that property is called basic. A. Abrams et 
al. [1] extended that uniqueness result to a larger family of energies. Also ropelength 
is basic, and more generally, % as well; see [119, Lemma 7]. A more recent mono- 
tonicity formula** for compact free boundary surfaces by A. Volkmann [129, Section 
5] implies the same for the tangent-point energy &% (see (2.1.7)), and hence also .% 
by the arguments in [116, Proof of Cor. 3.7]. All these results resolve some of the open 
problems formulated in geometric knot theory, e.g. in [125, Section 2], or in [86, Chap- 
ter 8], and give some first insights into the presumably complicated energy landscape 
of these energies on knot space. Almost nothing is known about the actual shape of 
knotted energy minimisers, apart from the explicit continuous family of ideal links 
(minimising ropelength in fixed link classes) presented by J. Cantarella, R. B. Kusner, 
and J. M. Sullivan in [29], and necessary criticality conditions for ropelength minimis- 
ers [110, 26, 27]. Moreover, studying ideal knots in R* lead to the discovery of unique 
explicit solution families of longest (thick) ropes on the two-sphere by H. Gerlach and 
the second author; see [45] and the popular account in [44]. 


2.2 Trends and open problems 


Regularity. Higher regularity of local minimisers or critical points is only known ina 
few cases. Freedman et al. [41] used the Mébius-invariance of the MGbius energy Sysp 
defined in (2.1.1) to apply reflection arguments to show that local minimisers are of 
class C11, and they derived the Euler-Lagrange equation 


Emop(y + T 2 — Emen(y) 


bEmep(y, A) = lim (2.2.1) 


er iui hu) (y(u) - y(v)) - (AQ) - h(v)) 
=u ff ( Eun nq —v)/2 ) 
Mw) 
T=aWe oe” 


for injective and regular curves + and perturbations h both of class C11. Later Zh.- 
X. He [50] used this Euler-Lagrange equation to improve the regularity of local 4ysp- 
minimisers to C*°-smoothness; see also [98]. Quite recently, there has been consider- 
able progress through the work of Blatt, P. Reiter, and A. Schikorra, who established 
the following deep regularity result [22, Theorem I] in the correct fractional Sobolev 
space W?/?-? corresponding — according to Blatt’s earlier work [11] — to finite Mébius 
energy: 


2.6 Or alternatively, a secant map approach of Blatt reminiscent of an argument to prove the classic 
Fenchel inequality; see [129, Section 5]. 
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Theorem 2.2.1 (4ygp-critical points are smooth). Any arclength parametrised critical 
point ~ < W?/2- of the Mébius energy is C°°-smooth. 


It is remarkable that no use at all is made of the Mébius-invariance of Eysp to prove 
Theorem 2.2.1 in contrast to the previous work of Freedman et al. Here, one uses the 
Euler-Lagrange equation (suitably extended to the correct fractional Sobolev spaces) 
to first gain some additional regularity, i.e., a slightly higher integrability of the tan- 
gent [22, Theorem III], before a bootstrapping process can be started. One should point 
out that, similarly to other geometric equations like the variational equation for the 
Willmore functional, the Euler-Lagrange equation for Sjysp is in a sense critical, which 
requires some very intricate techniques that were developed in the context of frac- 
tional harmonic mappings [34, 33, 102, 101]. 

Somewhat less involved is the regularity proof for other members of O’Hara’s fam- 
ilies of repulsive potentials [82, 84, 85], namely for the energies £*, where a power 
a € (2,3) replaces the quadratic power in the denominators of (2.1.1). Blatt and Re- 
iter established the Fréchet-differentiability of &“ on the space of regular curves of 
finite energy, and proved C*°-smoothness for arclength parametrised critical points 
[19, Theorems 1.1 & 1.2]. 

Tocarry over this regularity program to critical points of geometric curvature ener- 
gies such as the tangent-point energy (2.1.7) or integral Menger curvature (2.1.5), Blatt 
and Reiter embedded those energies into larger two-parameter families of energies by 
decoupling the integrability exponent p into different powers for numerator and de- 
nominator of the integrands. In this way they obtain, for instance, modified tangent- 
point energies 


FQ / / a w\(x)d 0), (2.2.2) 
rP-(x, y) 
yey 
by replacing the p-th power of the inverse tangent-point radius 
: ; q 
— ust y) by the less geometric expression = dist (Tx, y) 
rep(x, y) Ix-y| r'P-9(x, y) |x - y|P 


(Here, T, denotes the tangent-line to 7 through the point x € ¥.) In the parameter 
regime g > landp ¢€ (q+ 2,2q + 1) the modified tangent-point energies turn out 
to be well-behaved knot energies that are minimisable in every knot class. The frac- 
tional Sobolev regularity W-))/4-4 characterises finite energy (see[20, Theorems 1.1 
& 1.3]), and allows a first variation formula even without Cauchy principal values [20, 
Theorem 1.4] in contrast to the variational equations of O’Hara’s repulsive energies. 
Blatt and Reiter then identify a non-degenerate parameter range gq = 2, p € (4,5) 
that permits a regularity result[20, Theorem 1.5]. Unfortunately this range excludes 
the original tangent-point energy é = 7???" (cf. (2.1.7). 


Theorem 2.2.2 (7.)-critical points aresmooth). For p € (4,5) any ZP-2)- 
critical arclength parametrised injective curve of class W-!?,? is C*°-smooth. 
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Asimilar result holds also for a subfamily of modified integral Menger energies; see [21], 
but apart from that non-degenerate parameter regime, in particular for (the original) 
integral Menger curvature (2.1.5) and tangent point energies (2.1.7) the regularity of 
critical points, even that of local minimisers remains open. 


Open questions 2.2.3. Despite degeneracies in the non-local Euler-Lagrange equa- 
tions, is there any chance to prove additional regularity (beyond the fractional Sobolev 
regularity characterising energy spaces) for local minimisers or critical points of tangent- 
point energies (2.1.7), or of integral Menger curvature (2.1.5)? Does the optimal regular- 
ity depend on p, and if yes, what happens with that p-dependent regularity in the limit 
p — ce? Does that lead to new insights into the still open optimal regularity of ideal 
knots? 


We know that the geometric curvature energies in higher dimensions such as inte- 
gral Menger curvature (2.1.8), (2.1.11), or the tangent-point energy (2.1.10) can be min- 
imised within given isotopy classes of submanifolds [117, Theorem 7.1], [120, Theorem 
1.7], [65, Corollary 1], but nothing is known about higher regularity of these minimis- 
ers, not to speak of a regularity statement about possible critical points. Not even a 
variational equation has been derived so far in higher dimensions. In case of the non- 
smooth energies (2.1.15), (2.1.16), and also for the one-dimensional prototype (2.1.3), 
non-smooth analysis tools such as Clarke gradients would have to be applied to de- 
rive the variational differential inclusion, similar to the analysis performed for the ro- 
pelength functional for curves involving the non-smooth expression (2.1.2) for thick- 
ness; see [109, 110, 26, 27]. 


Open questions 2.2.4. What are the Euler-Lagrange-equations for higher-dimensional 
geometric curvature energies like integral Menger curvature (2.1.5) or tangent-point en- 
ergies (2.1.10)? Is there any chance to prove higher regularity of local minimisers or crit- 
ical points of these energies? What form do the expected variational inclusions have for 
the non-smooth geometric energies (2.1.3), (2.1.15), (2.1.16)? What can be said about the 
regularity of thick knotted surfaces minimising area? 


Below or in the scale-invariant regime. Integral Menger curvature .Z on one- 
dimensional sets E c C with integrability exponent p = 2 (well below the scale- 
invariant exponent p = 3) has played a fundamental réle in harmonic analysis, e.g., 
in the proof of the famous Vitushkin conjecture on the removability of compact sub- 
sets of the complex plane for complex analytic functions; see, for instance, X. Tolsa’s 
quite recent excellent monograph [126]. Motivated by some of G. David’s methods [35] 
for his final proof of this conjecture, J.-C. Léger [70] proved the following remarkable 
rectifiability result: 
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Theorem 2.2.5 (Rectifiability for sets of finite integral Menger curvature .%). Any 
Borel set E Cc R" with 0 < #1(E) < satisfying 
ME) = i / i —) AK) dH yd) <0 (2.2.3) 
R2(x, y, 2) 
EE E 


is 1-rectifiable, i.e., there exists a countable family of Lipschitz functions f; : R + R" 
such that 71(E \U; fi(R)) = 0. 


So, even below scale-invariance, integral Menger curvature has regularising effects on 
sets. In our context of geometric curvature energies one is naturally lead to the ques- 
tion if one can generalise Leger’s deep result to sets of higher dimensions? What are 
suitable generalisations of the integrand in (2.2.3) which is defined on point triples 
forming two-dimensional simplices. Already for our generalisations to surfaces and 
submanifolds as given in (2.1.9) and (2.1.12) we had discussed several variants of in- 
tegrands defined on general (k + 1)-dimensional simplices; see, e.g., the introduction 
and appendix of [120]. Recently, M. Meurer [75] presented a collection of integrands 
for which Léger’s result could indeed be extended to arbitrary dimensions and co- 
dimensions, including, e.g., the integrand, defined on (k + 1)-dimensional simplices 
T = (Xo, X1,-+ +» Xk41)3 see [75, Section 3.2], 


I(T) 


diam (T)(+(k+2)/2’ (2.2.4) 


Ky (T) = Ku(xo, Seaness Xt) = 


which is one out of several possible generalisations of Léger’s integrand 1/R(x, y, Z) 
in (2.2.3). Meurer could prove the following rectifiability theorem [75, Theorem 1.1]. 


Theorem 2.2.6 (Rectifiability in arbitrary dimensions). Any Borel set E c R" withO < 
ICE) < 00 satisfying 


ME) = i x if Keepy ves Xpar) AHN %0) + dH" yu) < 00 (2.2.5) 
E E 


k + 2 integrals 


is k-rectifiable, i.e., can be covered (up to sets of #*-measure zero) by a countable union 
of Lipschitz images of R*. 


Meutrer’s class of admissible integrands includes also the discrete curvatures used by 
G. Lerman and J. T. Whitehouse in [71, 72] to give a characterisation of David’s and S. 
Semmes’ concept of uniform rectifiability; cf [36, Theorem 1.57]. J. Azzam and Tolsa 
[7] recently established a new rectifiability criterion in terms of P. Jones’s B-numbers 
[56] which are fundamentally related to integral Menger curvature as shown in [70, 
75]. Interestingly, however, and somewhat surprising is the fact, that the integrands 
(2.1.9) and (2.1.12) we studied in the integrability regime above scale-invariance, are 
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not included in Meurer’s class of integrands; they do scale differently. At this point it 
remains open, if one can replace Ky in (2.2.5) by the expression XK defined in (2.1.12). 

In the definition of rectifiability one covers the set (up to a set of measure zero) by 
Lipschitz images, and one might think about improving the regularity of the covering 
images. The step from Lipschitz to C!-images is immediate by Whitney’s extension 
theorem; see, e.g. [112, Section 3, Lemma 11.1], but improving that to C1“ (as in the 
regime above scale-invariance) is highly non-trivial. This was recently accomplished 
by Kolasifski [63] also for a large class of discrete curvatures with a certain overlap 
with Meurer’s class including (2.2.4), so that, e.g., the following higher order rectifia- 
bility result holds true and can be deduced from [63, Theorem 1.1]. 


Theorem 2.2.7 (C'*-rectifiability). Any Borel set E c R" with0 < #*(E) < co and 
a.e. positive lower density, satisfying 


Mie ae / KE (X05 66+» Xia) AICK(x0) + AI) <0 (2.2.6) 
E E 


k + 2 integrals 


for some p > 2 is k-rectifiable of class C's“ for some positive Hélder exponent a = a(p), 
i.e., the set E can be covered (up to sets of Jé*-measure zero) by a countable union of 
k-dimensional C!*-submanifolds of R". 


Open questions 2.2.8. Can one extend Meurer’s rectifiability result to the integrands 
(2.1.9) or (2.1.12) of integral Menger curvature or to the tangent-point energies (2.1.10) 
defined on a suitable wide class of non-smooth sets? How does Meurer’s result relate to 
other recent rectifiability results like [127, 7]? 


Not much is known about geometric curvature energies in the scale-invariant regime, 
but simple scaling arguments reveal the fact that cone-type singularities do lead to 
infinite geometric curvature energies; see Figure 2.2. S. Scholtes could indeed demon- 
strate that embedded polygons have finite integral Menger curvature .% if and only 
if p € (0, 3); see [103]. Recall that p = 3 is the scale-invariant exponent for integral 
Menger curvature for curves. In addition, Scholtes established certain weak tangential 
properties of arbitrary (a priori fairly wild) sets at every point if the one-dimensional 
set E C R" has finite integral Menger curvature .43(E) [104]. 

So, one can indeed hope for mild regularising effects, like for the energy % for 
curves for p = 1, where we proved in [119, Theorem 1] that finite 7-energy implies 
that the curve is embedded and in the Sobolev class W”’!. However, not every em- 
bedded W?:!-curve has finite U;-energy; see [119, Example pp. 120-121]. Finiteness of 
the tangent-point energy > in the scale-invariant case p = 2 (see definition (2.1.7)) 
yields at least a topological one-dimensional manifold — possibly with boundary; see 
[121, Theorem 1.1]. Only for the Mobius energy (2.1.1), whose Mébius-invariance implies 
scale-invariance, one has Blatt’s [11] characterisation of the appropriate energy spaces 
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Fig. 2.2: Acone has infinite scale-invariant geometric curvature &, since scaling of a fixed portion S 
of the cone leads to the same quantum of energy 4(S) = &(S/2) = &(S/4) =.... Adding up these 
infinitely many contributions £(S/2'), i € N leads to a divergent series of positive real numbers as a 
lower bound for the cone’s energy if (S/2') N (S/2/) = Ofori 4 j. 


as the fractional Sobolev space W?/?? (assuming injective arclength parametrised 
curves), and already earlier Blatt and Reiter used an idea of He to construct a closed bi- 
Lipschitz curve with finite Mobius energy that is not differentiable [18, Corollary 4.2]. 
But very recently, Blatt has established a nice approximation result on convolutions of 
curves whose tangents have vanishing mean oscillations which in particular implies 
that arclength parametrised curves of finite Mobius energy can be approximated in the 
w?/2-2_norm and in energy” by smooth curves; see [14, Theorem 1.3]. At present there 
are a few suggestions how to generalise the Mébius energy to higher-dimensional sub- 
manifolds — we are aware of Kusner and Sullivan [67, 68] and D. Auckly and L. Sadun 
[5] (see also the very recent contribution by O’Hara and G. Solanes [88], [87]) — but 
no satisfactory analysis regarding regularity or variational issues has been performed 
yet. 


2.7 This has various consequences in geometric knot theory, for instance, it completes Scholtes’ recent 
investigations on a discrete version of the Mébius energy for polygons with n edges, that can now be 
shown to I’-converge to the Mébius energy (2.1.1) as n — 9; see [105, Theorem 1.1] and [14, Theorem 
3.8]. We do not address the very interesting questions regarding suitable discretisations and merely 
refer to the work of Rawdon et al. [92, 93, 95, 78, 94, 96, 97, 106] on discretised versions of ropelength, 
and to [108, 105, 107] for discretisations of a few other geometric curvature energies. 
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The scale-invariant exponent for integral Menger curvature .%(2) on two- 
dimensional surfaces © c R? is p = 8, and we proved a Fenchel-type theorem [120, 
Theorem 1.2]. 


Theorem 2.2.9 (Fenchel-type theorem). There is an absolute constant yo > 0 such that 
(2) = yo for any closed compact connected two-dimensional Lipschitz surface X C 
R?. 


Due to our rather rough estimates the constant +o is far from being optimal. 


Open questions 2.2.10. What is the optimal regularity of finite energy Lipschitz curves 
or submanifolds for any of the geometric curvature energies in the scale-invariant case? 
How regular are submanifolds of finite energy for a suitable generalisation of the Mébius 
energy to higher dimensional objects? How much geometric curvature energy does one 
really need to close a curve or a surface, in other words what are the optimal constants 
in Fenchel-type theorems like Theorem 2.2.9? Is every k-dimensional manifold M* im- 
mersed in R" automatically embedded if its image has finite scale-invariant tangent- 
point energy &, (see (2.1.10)), or finite scale-invariant integral Menger curvature 44.7) 
(see (2.1.11))? 


Existence of critical points. For all geometric curvature energies above scale- 
invariance one can find (at least one) minimising knot in a given isotopy class. This 
even works for higher-dimensional geometric curvature energies such as integral 
Menger curvature or tangent-point energies for submanifolds as described above. But 
are there other critical points, and how can one prove their existence? One of the first 
attempts in that direction is the work of D. Kim and Kusner [59] on the MGbius energy. 
They applied R. S. Palais’ principle of symmetric criticality [89] to obtain Gygp-critical 
torus knots by minimising the Mobius energy within the appropriate subclass of torus 
knots enjoying particular symmetries . In addition, together with G. Stengle [59, p. 4] 
they used classic residue calculus from complex analysis to calculate their energy val- 
ues. Further numerical experiments lead them to conjecture that most of these Sysp- 
critical torus knots are not local minimisers. For the non-smooth ropelength func- 
tional Cantarella et al. [28] successfully modified Palais’ symmetric criticality prin- 
ciple to find new critical points in several symmetry classes of knots and links, e.g. 
in the non-trivial (a, b)-torus knots. They used their numerical ropelength minimis- 
ing algorithm RIDGE RUNNER to compute their respective values for ropelength; see 
Figure 2.3. 

In ongoing cooperative work with A. Gilsbach we apply Palais’ principle to O’Hara’s 
repulsive energies, integral Menger curvature, and tangent-point energies to produce 
symmetric critical configurations in every prescribed knot class. Specifically, in non- 
trivial (a, b)-torus knot classes we even obtain two distinct symmetric critical knots 
with this method [46], [47]. Very helpful in that context is the knowledge of the respec- 
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Fig. 2.3: A5-fold and a 2-fold symmetric ropelength-critical (2, 5)-torus knot, both with ropelength 
values distinctly larger than the unsymmetric global ropelength minimiser on the right accord- 
ing to the computations with RIDGE RUNNER of Cantarella et al. in [28]. (Images by courtesy of 

J. Cantarella.) 


tive correct energy spaces described in Section 2.1. Gilsbach also uses '-convergence 
arguments to show that her symmetric critical points of integral Menger curvature .Z, 
do converge to ropelength-critical points as p — oo. Recently, Gilsbach has modi- 
fied T. Hermes’ numerical code [52] to actually compute the energy values of the sym- 
metric critical points of integral Menger curvature. Hermes had rigorously derived the 
first variation formula for integral Menger curvature in the suitable fractional Sobolev 
space, and could prove that the round circle is a critical point. He created a numer- 
ical tool to explore the presumably quite complicated energy landscape of integral 
Menger curvature. His numerical experiments exhibit among other things the ability 
of the Menger gradient flow to untangle complicated unknots to the round circle after 
fairly short time, as well as varying features as p approaches infinity. For p only slightly 
above the scale-invariant exponent one finds smoothing as the predominant feature 
(while keeping the curves embedded in contrast to, e.g., the classic mean curvature 
flow on space curves), whereas for large p, say p = 50, the similarity to Cantarella’s 
RIDGE RUNNER (corresponding to the case p = os) is striking [4]: both flows try to em- 
bed the curves as nicely as possible. 

Asecond variation formula has been derived and analysed in detail for the M6bius 
energy by A. Ishizeki and T. Nagasawa [54]. They used a very interesting decompo- 
sition theorem for the Mébius energy itself [53], and studied recently the Mébius- 
invariance of the various parts of that decomposition [55]. Also quite recently J. Knapp- 
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The gradient flow for integral 
Menger curvature flows the ini- 
tial configuration above to the 
trefoil on the top right for p = 
3.5, and to the bottom trefoil on 
the right as p = 50. 


Fig. 2.4: Different parameters p > 3 lead to different final configurations for the gradient flow of 
the rescaled integral Menger curvature but the knot type is preserved. (Images by courtesy of T. Her- 
mes.) 


mann [60] succeeded in deriving rigorously a second variation formula for integral 
Menger curvature .% on curves in the appropriate fractional Sobolev spaces. 

The only approach to deal with higher-dimensional critical points for geometric 
curvature energies is the ingenious paper by A. Nabutovsky [80] who combined com- 
plexity theory with real algebraic geometry to prove the existence of infinitely many 
critical unknotted hyperspheres in R” for n = 6 for a higher-dimensional variant of 
ropelength. 


Open questions 2.2.11. Are there critical points for geometric curvature energies in 
every prescribed knot class other than the known global minimisers? In particular, are 
there critical unknots different*® from the round circle? And if so, how many are there? 
Can we relate critical points of different geometric curvature energies with each other, 
e.g., via ['-convergence? What can be said about the stability of such critical points? Is 
it possible to find critical configurations in higher dimensions? 


2.8 Energies that allow such critical unknots would therefore not be suitable to give an alternative 
proof of the Smale conjecture by means of a gradient flow as, e.g., suggested by Freedman et al. in [41, 
p. 41] for the Mobius energy. We do not address here the very challenging topic of gradient flows for 
geometric curvature energies and just refer to the pioneering work of Blatt on the gradient flow for the 
Mobius energy and other O’Hara energies [12, 15, 16] 
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Fig. 2.5: Springy knots: figure-eight knot, mathematician’s loop, and Chinese button knot. Wire 
models manufactured by wHy KNOTS, Aptos, in 1980; photographs by B. Bollwerk, Aachen. 


Further implications on geometric knot theory. The round circle minimises many 
of the known geometric knot energies [41, Corollary 2.2],[1], [119, Lemma 7],[129, Corol- 
lary 5.12], and we expect the same for integral Menger curvature due to strong numer- 
ical evidence based on Hermes’ numerical experiments with his gradient flow algo- 
rithm [52, Section 4.3]. In addition, we mentioned the explicit continuous families 
of ropelength-minimising links constructed by Cantarella et al. [29]. More recently, 
I. Agol, F. C. Marques, and A. Néves applied their ingenious min-max-theory for mini- 
mal currents to resolve not only the famous Willmore conjecture [74] but also a conjec- 
ture by Freedman et al. by proving that the stereographic projection of the standard 
Hopf-link minimises the Mobius energy; see [2]. Apart from these results nothing is 
known analytically about the shape of non-trivially knotted minimising curves. For 
the ropelength-minimising trefoil, the so-called ideal trefoil one has presumably fairly 
accurate numerical solutions [8, 30, 9, 4, 91] and some local analytic information on 
the possible shape of general ideal knots [110, 26, 39, 40, 27] extracted from the com- 
plicated necessary conditions. 

If one combines geometric curvature energies with (higher order) local energies like 
the classic bending energy 4yena(7) := ff ‘i x? ds, one can study minimal configurations 
for such energies under topological constraints on the competing curves or surfaces. 
This leads to the concept of elastic knots that can be obtained as limits of minimisers 
+g of the total energy ‘ 
Aly] 
as 9 —> O. Recall that A[y] denotes the thickness as defined in (2.1.2), so that the par- 
ticular geometric curvature energy chosen in (2.2.7) is the ropelength functional if one 
restricts to curves of length one. Indeed, it can be shown [43, Theorem 2.2] that in every 
given knot class one finds such a limiting curve yo, which has smaller bending energy 
than any knotted competitor. However, as one would expect from the simple toy mod- 
els of knotted wires designed by J. C. Langer, see Figure 2.5, +o has self-intersections 
unless the given knot class is the unknot in which case pq is the round circle [43, Propo- 
sition 3.1]. 


E9(y) = Ghend(y) + 9 (2.2.7) 
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Is there anything one can say about the actual shape of the elastic knot +o for non- 
trivial knot classes? This is indeed the case for any (2, b)-torus knot as shown in [43, 
Corollary 6.5]: 


Theorem 2.2.12 (Elastic (2, b)-torus knots). For any odd integer |b| = 3 the unique 
elastic (2, b)-torus knot is the doubly covered circle. In particular, the elastic trefoil is 
the doubly covered circle. 


This result confirms mechanical and numerical experiments (see Figure 2.6), as well 
as the heuristics and Metropolis Monte Carlo simulations of R. Gallotti and O. Pierre- 
Louis [42, 90], and the numerical gradient-descent results by S. Avvakumov and 
A. Sossinsky [6]. However, adding twist changes the geometry of the springy wire dras- 
tically; see bottom right of Figure 2.6. And there is no theory yet, describing these twist 
effects for knotted elastic wires. 


9= 0.001 


Fig. 2.6: Top: Numerically computed minimisers of the total energy &% in the class of trefoils ap- 
proaching the doubly covered circle as 9 tends to zero. Bottom left: Mechanical experiments: The 
springy trefoil knot is close to the doubly covered circle. Bottom right: Adding twist leads to a stable 
flat trefoil configuration close to a planar figure-eight. (Wire models by courtesy of J. H. Maddocks, 
Lausanne.) 
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Open questions 2.2.13. Does the round circle minimise integral Menger curvature 
(2.1.5)? Can one prove more about the actual shape of ideal knots? What is the shape of 
links with more than two components minimising the Mébius energy? Can one identify 
the shapes of global minimisers of other geometric curvature energies for curves? What 
can be said about elastic knots for non-trivial knot classes different from (2, b)-torus 
knots? What is the shape of twisted elastic knots? 


A higher-dimensional branch of geometric knot theory is naturally much less devel- 
oped yet. The shape of possible minimising configurations for higher-dimensional ge- 
ometric curvature energies is wide open and seems currently out of reach. There is one 
exception, however: We proved in [64, Theorem 1.5] with the isoperimetric inequal- 
ity and a simple measure-theoretic argument the following uniqueness result for the 
global tangent-point-energy (2.1.16): 


Theorem 2.2.14 (Spheres are unique minimisers). The round sphere uniquely min- 
imises the global tangent-point energy (2.1.16) among all compact embedded C'- 
hypersurfaces in R". 


Recently, we proved in [65] that many higher-dimensional geometric curvature en- 
ergies including integral Menger curvature (2.1.8), (2.1.11) or tangent-point energies 
(2.1.10), or their more singular variants (2.1.15), (2.1.16), are valuable knot energies. 
All these energies are self-repulsive (on the scale above scale-invariance), lower- 
semicontinuous on sublevel sets with respect to Hausdorff-convergence, they enjoy 
nice compactness properties and can thus be minimised in given isotopy classes; see 
[65, Theorem 2, Corollary 1]. They also bound the number of isotopy types with explicit 
constants only depending on the energy level and the integrability exponent, on a di- 
ameter bound, and on the dimensions [65, Theorem 1 & Remark 1.1]. In particular, one 
has the following boundedness result. 


Theorem 2.2.15 (Isotopy finiteness). Let E,d > 0 be given. Then there are at most 
K = K(E, d, k, n, p) different ambient isotopy types among all k-dimensional Lipschitz 
submanifolds X Cc R" with integral Menger curvature @ (2) < E and diam < d. This 
constant may be estimated as 


log log K < c(k, n, P)(| log d| + log(E!/” +1)+ 1). (2.2.8) 


This result can be compared to a whole series of finiteness theorems of diffeomorphism 
types under given bounds on classic curvatures, beginning with the work of J. Cheeger 
[31], and extended by many others, see, e.g, Cheeger’s exhaustive survey [32] and the 
references therein. The notable difference is here that we deal with embedded sub- 
manifolds of lower regularity, whose Riemannian metrics are just Holder continuous 
so that the classic notion of curvature does not make sense. The geometric curvature 
energies in the regime above scale-invariance turn out to be valuable substitutes. The 
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only comparable result with this emphasis is the work of 0. Durumeric [38] who, how- 
ever, works in the context of C!:!-submanifolds with positive thickness. 

Higher-dimensional variants of elastic knots have not been discussed explicitly 
yet, but L. Simon’s pioneering work [113] solves the problem of minimising the Will- 
more energy 


i H? (x) d.7(x) (2.2.9) 
Py 


in the class of two-dimensional embedded surfaces with prescribed genus or under 
alternative constraints; see also [10, 100, 111, 69, 58, 99, 79]. But minimising the Will- 
more energy or related functionals such as the Helfrich functional [51, 81] on given 
isotopy classes has to the best of our knowledge not been investigated yet — with the 
exception of recent work of P. Breuning, J. Hirsch, and E. Mader-Baumdicker [23] on 
Willmore minimising Klein bottles. 


Open questions 2.2.16. Is it possible to identify the shape of global minimisers of 
higher-dimensional geometric curvature energies? The explicit constant in Theorem 
2.2.15 is far from being optimal, what is the best constant bounding the number of iso- 
topy types under given energy values? For curves the energies could often be related 
to knot invariants or to quantities like the average crossing number (2.1.17) controlling 
knot invariants. Are there meaningful topological invariants or geometric quantities for 
higher-dimensional knots that could be controlled by means of higher-dimensional geo- 
metric curvature energies? Are there higher-dimensional elastic knots, for instance min- 
imisers of the Willmore functional in arbitrary prescribed isotopy classes? And can one 
say anything about their shapes? 
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Takeyuki Nagasawa 
On Mobius invariant decomposition of the 
Mobius energy 


3.1 O’Hara’s knot energies 


In his early paper [17], O’Hara took inspiration from Fukuhara’s paper [6] to propose 
the use of the knot energy 


a 1 ‘| e 
tant= ff Gosicr AE FEar) asides 


(R/£Z) 


to determine the “canonical configuration” of knots of a given knot type. Here f : 
R/LZ > R? represents a knot, i.e., a closed curve without self-intersections that is 
parametrized by arc-length s and has total length £. The notation J represents the in- 
trinsic distance (i.e., the distance along the curve) between two points f(s;) and f(s2), 
and a and p are positive constants. Because the canonical configuration should be of 
a shape that makes it as easy as possible to determine how it is knotted, O’Hara first 
considered the negative power of the Euclidean distance of two points as the energy 
density. However, this may result in the divergence of energy for all knots. Thus, to 
avoid such a situation, he subtracted the intrinsic distance raised to the same power. 
This means that the energy calculates the proximity of every two points, but excludes 
the singularity at the diagonal set {(s;, s2) € (R/£Z)*|s, = sz}. This procedure ex- 
tracts the convergence part from the divergence integral, which O’Hara calls renormal- 
ization, and is mathematically equivalent to Hadamard’s regularization. 

Let us consider the problem of minimizing O’Hara’s energy in a given knot type 
[K], i.e., the ambient class of given knot K. 


Remark 3.1.1. When ap # 2, the energy does not exhibit scaling invariance, and 
therefore we consider the problem under a length constraint. 


Because the energy € is non-negative, there exists a minimizing sequence in the set 
of “all knots”. The first problem is the existence of minimizers in the class [K]. When 
ap = 2, the energy induces self-repulsiveness. That is, the knots cannot be continu- 
ously deformed into a different class while preserving the finiteness of the energy. 
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— ? 
SO ee 


Fig. 3.1: pull-tight (The image was firstly appeared in [16].) 


Theorem 3.1.1 ((19]). Eq, p) is finite for smooth embedded knots if and only if a and p 
satisfy 


+; and 2<a<4, 


as2, orp< 


Furthermore, if ap = 2, then E(a,p) induces self-repulsiveness. 


This means that there exists an energy wall between two classes of knots when ap = 2. 
In other words, when ap < 2, there is no energy wall, and therefore the minimizing 
sequence can freely move into other knot classes. This suggests that minimizers are 
not guaranteed to exist for any given knot type except for the trivial type. 

Furthermore there is an energy hole when ap = 2. This is the case when the energy 
is scaling-invariant. Scaling-invariance may allow the pull-tight phenomena to occur 
along the sequence. If such a phenomena occurs, then the limit knot, if it exists, is not 
of the same knot type as the sequence. The energy behavior along the pull-tight for 
E(q,p) has been studied by O’Hara [18]. 


Theorem 3.1.2 ((18]). Leta knot K, be aconnected sum of K anda small tangle T;. The 
difference of the energy D(e) = E(a,p) (Ke) - Ew, p)(K) behaves as follows in a pull-tight 
process T; — {a point}: 

— D(e) blows up when ap > 2. 

— D(e) converges to a positive constant when ap = 2. 

— D(e) vanishes when ap < 2. 


Consequently, when ap = 2, a knot can be moved into another class while preserv- 
ing the finiteness of the energy, through a pull-tight process that we call the energy 
hole. This effect is related to the concentration of energy density at scaling-invariant 
energies. 

Hence, when ap > 2, the energy wall is present but without energy holes, and we 
can expect that the direct method of calculus of variations can be applied. Indeed, the 
following result was demonstrated by O’Hara. 


Theorem 3.1.3 ((19]). Let n = 3. Then, there exists a minimizer (under rescaling) for 
any knot types if ap > 2. 


We call the cases ap > 2, = 2, and < 2 subcritical, critical, and supercritical, respec- 
tively. 
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3.2 Freedman-He-Wang’s procedure and 
the Kusner-Sullivan conjecture 


A pull-tight entails the disappearance of a tangle. The above results implies that a 
pull-tight can occur in critical and supercritical cases. This shows that the argument 
of minimizing sequence fails in such cases. Nevertheless, the argument applies for 
the case (a, p) = (2,1) in prime knot classes. This remarkable result was proven by 
Freedman-He-Wang [5]. 


Definiton 3.2.1. Let n = 3. A knot is a composite knot if it is a connected sum of two 
non-trivial knots. A prime knot is a knot that is neither composite nor trivial. 


The authors showed that €(,,) is not only invariant under scaling, but also under 
Mobius transformations. Since then, this has been called the Mobius energy. 


Theorem 3.2.1 ((5]). There exists a minimizer of E(2,1) on each prime or trivial knot 
type. 


The key point is avoiding the occurrence of a pull-tight along the minimizing se- 
quence. If the knot is prime, then we can enlarge the tangle by inversion with respect 
to a sphere near the shrinking tangle, without changing the energy level. The authors 
passed to the limit of the minimizing sequence together with, if necessary, the en- 
larged tangle, and showed the limit knot is a minimizer in the given knot class. If the 
knot is composite, then two tangles may shrink simultaneously. Hence, such a method 
does not apply. 
For a knot class [K], we denote 


é K]) = inf € = 
onl) = inf Ean 


Fig. 3.2: A composite knot A prime knot 
(The images were firstly appeared in [16].) 
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transformation 


Fig. 3.3: Freedman-He-Wang’s procedure (The image was firstly appeared in [16].) 


Note that Ean (KD exists, because the energy density is non-negative. Kusner and 
Sullivan [14] investigated the energy €(, ,) for various knots numerically, and proposed 
the following conjecture. 
Conjecture 3.2.1 (The Kusner-Sullivan conjecture [14]). 
1. There do not exist minimizers of composite knot type. 
2. Assume that f ¢€ [K] is composite, and it is a connected sum f, tf>, for f; € [Kj]. 
Then, it holds that 


Ee, (IK) 5 Ee, ([Ki) + Ee, ([K2)). 


As far as the author knows, this remains an open conjecture. 


3.3 Basic properties of the Mébius energy 


In what follows, we consider the Mobius energy in R", which is denoted simply by €: 


1 1 
(f) // Gosicn ale Faye) a4. 
(R/£Z)2 


Mobius invariance still holds for the n-dimensional case. As stated in the previous sec- 
tion, the minimizing problem for the Mébius energy was partially solved and remains 
partially unsolved. Hence, it seems important to study further properties of €. To begin 
with it, we summarize some basic properties that are given without proofs. 

Because our energy was introduced to determine the “canonical” configuration 
of knots, we expect that the finiteness of energy suggests some regularity of curves. 
Indeed, we have the bi-Lipschitz estimate for a curve with finite energy, which means 
that the curve cannot bend rapidly. 

Because we use the arc-length parameter, the fact that 


IF (s1) - Fls2)lin» S AF(S1), Fls2)) 


is trivial. That is, f is Lipschitz continuous, with the Lipschitz constant 1. The finite- 
ness of € implies the converse estimate. 
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Theorem 3.3.1 ([19, 2]). If E(f) < M, then there exists A = A(M) > O such that 
If (si) - F(S2)\Ipn 2 APF(S1), F(s2)). 


We call these estimates bi-Lipschitz continuity. In the following, we assume that f is 
bi-Lipschitz. 

The finiteness of energy not only implies bi-Lipschitz continuity but also the inte- 
grability of (fractional) derivatives. 


Definiton 3.3.1 (Sobolev-Slobodeckij space). For a non-negative integer j, and a € 
(0, 1), Wi**P(R/ LZ, R") is defined as 


WiP(R/LZ, R") = (f € W?(R/LZ, R")| [Flap < 9}, 


i 
p 


& 
27 . . 
IF (sa + 2) - FSP 
fF lap = / i migag Ri ds7ds1 
/£Z-£ 


with the norm 
IFllweer = UF llwie + FO lap. 


When p = 2, we denote W/**?(R/L£Z) by H/**(R/LZ). 

The following result was given by Blatt [2]. 

Theorem 3.3.2 ((2]). The finiteness result €(f) < coimplies the bi-Lipschitz continuity 
of f andf ¢ H?/2(R/LZ) n W)(R/LZ). The converse is also true, ie., if f is bi- 
Lipschitz and belongs to H?/2(R/£LZ) n W'(R/LZ), then E(f) is finite. 


This indicates the proper domain of €. 


Remark 3.3.1. O’Hara and Blatt demonstrated that the above results not only hold for 
the Mobius energy, but also for the general O’Hara’s energy. For details, we refer the 
reader to the papers of those authors. 


The criticality of energy can provide further information regarding regularity. Several 
results have been established. Freedman, He, and Wang [5] and He [7] demonstrated 
the regularity of local minimizers. 


Theorem 3.3.3 ((5, 7]). Local minimizers of € with respect to L°(R/£LZ) topology are 
smooth. 


Reiter [21] proved the regularity for not only local minimizers, but also critical points. 


Theorem 3.3.4 ((21]). Any critical point of € in W7’*(IR/£Z) is smooth. 
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Recently, Blatt, Reiter, and Schikorra [4] improved on the assumptions of the previous 
result. 


Theorem 3.3.5 ((4]). Any critical point of € with finite energy is smooth. 


3.4 The Mobius invariant decomposition 


In this section, we present a decomposition of the Mébius energy, where each part 
retains the Mébius invariance. This is based on a series of papers [10]—[13] written 
with Ishizeki. See also the dissertation of that author [9]. According to Theorems 3.3.1 
and 3.3.2, we may assume the existence of the unit tangent vector t(s) = f(s) almost 
everywhere, which satisfies some integrability condition. Then, the energy may be 
decomposed into three parts: 


Ef) = E1f) + E2(f) + 4. 


See Theorem 3.4.1 below for details. This was recently demonstrated by our research 
group [10]. The first decomposed energy €, is an analogue of the Gagliardo semi-norm 
of t in the fractional Sobolev space H 1/2(R/ LZ). Indeed, if we replace the Euclidean 
distance in the denominator of the energy density of €; (see.% in Theorem 3.4.1) with 
the intrinsic distance, we find that 


jas 2 ; 
Ei(f) = >| / eat ds,dsz = UF Fi. 


This implies that the domain of € is H?/?(R/£Z) Nn W!*(R/LZ), as was shown by 
Blatt. The energy density of the second energy €> has a determinant structure, which 
implies a cancellation of the integrand. O’Hara helpfully informed the current author 
that the second energy is a constant multiple of the O’Hara-Solanes energy ([20]). Asa 
by-product, we can provide an alternative proof of the known fact that the right circle, 
i.e., the circle with constant curvature, is the only global minimizer of the Mébius en- 
ergy. In Section 3.4.1, we give the precise statement of the decomposition and a sketch 
of the proof. 

Using this decomposition, we can derive explicit expressions for variational for- 
mulae and reasonable estimates in several function spaces, including absolute inte- 
grability ([11, 13]). Indeed, because the last part, “4”, is an absolute constant, we can 
ignore this when considering variational problems. This fact shortens the derivation 
of variational formulae, and enables us to find their “good” estimates for them in sev- 
eral functional spaces [11]. Furthermore, we can determine the L?-gradient of each 
decomposed energy that contains the fractional Laplacian (—A;)?/? as the principal 
term [13]. See Section 3.4.2 for further details. 

Surprisingly, each of the components of the decomposition is Mébius invariant. 
In Section 3.4.3, we focus on this fact, which has been studied in [10, 12]. As a conse- 
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quence, we can demonstrate that right circles are the only global minimizers of €; with 
the minimum value 277. This seems to be related to the fact that the first eigenvalue 


3 
of (-As)2/? is (=) 
Ly 


3.4.1 The decomposition 
The precise statement of the decomposition is as follows. 


Theorem 3.4.1 ({10]). Assume that f satisfies the bi-Lipshitz estimate, and that it be- 
longs to H?/2(R/LZ) mn W»(R/ LZ). Let t(s) = f (s) be the unit tangent vector. Then, 
it holds that 

Ef) = €1(9) + £209) + 4, 


where 
e(f) = if | Mf) ds dsp, 
(R/L£Z)2 
— |[t(s1) - T(s2)||2n 
chai TT CHES (C9) 
7 2 T(s1) + T(S2) (f(s) -F(S2)) - t(s1) 
noe, reer *( (Fs1)-Fls2))+r(s2)_ Iiflss) ~ f(s.) I2» ) 


Proof. In order to deform the energy density, we first consider following integration 
in the sense of Cauchy’s principal value, 


lim // +++ dsids>, 
€—+0 
|s1-So|Ze 


and derive absolute integrability later. We differentiate A(f(s1), f(s2))* with respect 
to S>. In the sense of distributions, we have that 


26 |x| = pv.— 
dx g — p ’ 
where 
(pv.—,@) = lim if Plo) d 
€—>+0 
|x|Ze 


for p € Cp (R) (see [15]). Using the periodicity of A(f(s1), f(s2)), the distributional 
derivative is given by 
1 L 
3 “Ne (so + 3 <8, <s2+L (mod £)), 
0S> log AF (s1), F(s2)) = St 1 52 L 
$2 -py. (sp S81 < Sot 5 (mod £)). 


$1 —S2 
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Regarding this as a distribution of s;, we have that it is differentiable for s; 4 s+ £/2 
in the weak sense, and we obtain that 


log A(F(s1), F(s2)) = 


a2 F 1 
D5, 0S PM OF (51), Fls2))2 


As a function of s>, aS log A(f(s1), ban has a jump discontinuity at sz = s1 + re 


with a -4/Z£. As functions of s>, reer a log ||f(s1)—f(S2)||pn is bounded at sy = 5, +4 55 
and 2 3s; 108 Ilf (s1)-f(S2)||gn is continuous at the same points. Therefore, we obtain that 


/ ae in ds 
AF(sv, f(s” 


- lim 7: — (ioe iris - F(s2)|lax — log 


€S|s1-s2|S 5-6 


3? ) 
- [Pex loslitss)—Flc2)|v dsa+ | 52 10g 


€S|si-s2|SF> 


\lf(s1) feel ) as 
AF (s1), F(s2)) 


\If(s1) — f(S2)||ign BAF PERES i 
z 


AF (si); Fls2)) 7 


S2=S1-E 


We integrate this with respect to s,. Using periodicity, we note that 


\If(s1) - ee as 
/ so! o8 AF (si), f(s2)) So= Pe 


= , eee f(s2))- T(S}) 1 J ass 


IIf(si)-Flsa)llgn = $1 - 82 


(f(s1) -f(si +€))- tsi) — (f(si + €) - f(s1)): T(si+é) | +2 has 
\If(s1) — F(si + €)||2, If (si + €) -f (si) |l2n : 


2F (si + €) - f(si)) - tsi) , +2 has, 
If(si + €)- F(svllin 


a 


a — log |lF(s1) —F(S1 + 2)Ilpe + 


I [+ 1 1 ) 
IF(si+e)-F(si\ln 7 


) Sité 
"FS + 6) - Fs viien fo ry-ren ds}. 


S1 
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We have that 


(Rasa-ram @) 
F)-Feoe 2 


S{t+€ Si+E 


= (1 - 1(s3) - 1(s4))ds3dsy 


~ e2lif(si +6) —F (salen : 


Si +E Si +E 


1 
ellf(si + €) - (Si) lan : 


\It(s3) — T(s4)|lands3dsy 


and by applying the estimate ||f(s; + €) — f(s1)||px = Ate, and changing the order of 
integration, we obtain that 


Sit Si +E 


1 2 
- nds3dsid 
A elf: +8) —~Fewiiee i i Hsp ee leeladeetss 


Syt€ S 
2 4 3 
ee 

R/LZ S4-€ $3-€ 


2 * Ir(s3) - T(s4)]|2n 
SA i i “FG(S3), F(S4))2 345% 


< \|z(s3) — T(s4)||Rnds1ds3ds, 


R/LZ S4-€ 


Because 


AP \|t(s3) — T(s4) lan 
F line i FF (s3), F(sa))2 ds3ds,y 


(R/£Z)2 
is finite, the absolute continuity of the integration yields that 


S{+E S{tE 
t 2 
ay Fi T(S3) — T(S4)||ignds3ds4ds, =0 
im f elf. +8)-F Gin J IIt(s3 a)||auds3dsqdsy 
1. 1 


Similarly, we have that 


Site 


2 
need 
[ If(s: + 2) - fsi)Ie | (1- r(s3)- 1(s1))dssds1 


2 *|It(s3) - T(S1) || Zn 
<A | ii “OE Ss) FS) OS ds3ds; > 0 as e>+0. 


R/LZ 81 


Hence, we obtain that 


01, Iif(ss) - f(s2)|len ]?** 
/ ES log AGUS), Fls>)) [a +0 as €—+40, 


R/LZ 
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which leads to the expression 


1 0? 
Ef) = me HT Cire =F, 35195 log ||f(s1) -fls2)|iv ) Assds2 +4, 


By manipulating the log term in the above, we obtain that 
1+1(s1) + T(s2) 
eee a {Red re, 
HGS ah SED 
ds,dsp. 
[F (si) - fsa)Ii4 ur 


The density of this integral can be expressed as 


1+7(s1)-t(s2) 2 {(F(s1) - f(s2)) - t(s1)} {(F(s1) - F(s2)) - t(s2)} 


(3.4.1) 


If (si) —Flsa)|l2n \If(s1) - F(s2)\I bn 
_ 1-1(s1)+ T(s2) , 2t(si)- T(S2)|IF(S1) — F(S2)||fin 
IFS) -FGa)|2n If (S1) - F(sa)|Ihn 


{(f(s1) - F(s2)) - t(s1)} {F(s1) - F(s2)) « t(s2)} 
IF (1) - F(s2)|l gn 
_ It(s1)- ts 2)llan 2 
2If sv -fsvilz. Fs - FSI. 
oe ( t(s1)-t(52) (FS) - fls2)) + r(61) 
(F(s1) - f(s2))-t(s2) [If (81) - F(s2)|l2n 


= M(f)+ af). 
It remains to remove p.v. from in front of the double integral in (3.4.1). Because 
M(f) is non-negative, the energy €, is defined as the absolute integral for curves in 


H?2 (R/£LZ) with bi-Lipschitz continuity. Therefore, the absolute integrability of .7(f) 
follows from the fact that 


M(f) + a2(f) 2 0, 


which we will show below. 
We use the notation A to denote the difference between values at s = s; and sp: 


As=8,-S2, Af =f(si1)-f(s2), ete. 


For a function v on R/ LZ, we define Rv = Rv(s1, s2) by 


|As|(Av) 
Af les 


and observe that Rf is clearly a unit vector field. By applying the Lagrange formula, 
we have that 


Rv= 


Mf) = T(s1) \ Rf), (t(S2) ARF)), 


imi ( 


46 — Takeyuki Nagasawa 


where / is the wedge product of vectors, and (-, -) is the inner product on A R". It is 
easy to see that for and any two vectors x, y € R" and any unit vector e € R", 


(xe), (yA e)) = (Pex): (Pe y), 


where 
Pex =(x-e)e, Ptx=x- Pex. 
Therefore, 
ee 1 ai 
Mf) = Tag a Pats 1)) + (Pppt(s2)). 
AF lien 
On the other hand, the first density is given by 
1 ATI in 
Mf) = = . 
2 Af lin 


Consequently, we obtain that 
2||AF len) + aa(f) 


= ||AT||fn + 4(Ppyt(s1)) « (Ppt (s2)) 
= ||PppAT||fn + ||PppAT||n + 4(Pppt(s1)) - (Ppt (s2)) 


= ||PppAT||fn + ||Ppp(t(S1) + T(S2))|lfn, 


which is non-negative. 


As a consequence, we can provide an alternative proof for the following fact, which 
has already been established. See, for example, [5]. 


Corollary 3.4.1. It holds that €(f) 2 4. Equality holds if and only of f is a right circle. 


Proof. Theorem 3.4.1 and the non-negativity of .4,(f) + “(f) implies €(f) 2 4, and 
equality holds if and only if .4, (f) + 42(f) = 0. This is equivalent to 


PppAt=0, Ppp(t(si) + T(s2)) =O. 
In particular, from the second relation we find a function y such that 


Ff (si) - f(S2) = u(S1, $2)(t(s1) + T(S2)). (3.4.2) 
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Because a minimizer is smooth, so is p. We differentiate the above relation three times 
with respect to s;, to obtain that 


T(s1) = sh Oss, $2)(t(s1) + T(S2)) + u(S1, $2)x(S1), (3.4.3) 
K(s1) = a 2 ($1, $2)(T(S1) + T(S2)) + 2 SE (64, S2)K(S1) + U(S1,$2)K ($1), (3.4.4) 
eis) = He spe) 6) 3 Bt, s2)m( G4.5) 


fe) , 
+ 35E (S1, 52)x (S1) + w(s1, 82) (51). 


Setting s; = sp = s in (3.4.2)-(3.4.5), we have that 


r) 1 0 
(s,s) =0, 3g, 89) = > paz 8) = 0 
and 
x (s) = -4Sh HG, s)t(s). (3.4.6) 


Taking the inner product between each side ‘ef (3.4.6) and x(s), we know that ||«(s) ||; 
is independent of s, and we write this as x. If x = 0, then T (s) = x(s) = 0, and therefore 
t(s) is a constant vector. This is impossible, because f is a closed curve. Consequently, 
x > O. By taking the inner product between each side of (3.4.6) and t(s), we know that 


-42- Kis, s) = K(s)- t(s) = -K(s) + x(s) = -K? 
OS} 


By inserting this into (3.4.6), we obtain that 
K (s) +x*t(s) =0 
for every s € R/ LZ. Because t(s) = f (s), there exists a constant vector ¢ such that 
K(s) +k? (f(s) -¢) =0. (3.4.7) 
By integrating with respect to s on R/ LZ, and dividing by Z, we find that 


= 4 f(s) ds. 


R/LZ 
We can rewrite (3.4.7) as the following second order differential equation: 
(f(s)-c) +x? (f(s)-c) = 
The solution is given by 


sin Ks 


f(s) — ¢ = (f()-c) cosxs + 1(0). 


That is, f is a right circle with center c and radius x'. Because the total length is £, 
1 


the radius is x ~ = a 
2m 
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3.4.2 Variational formulae 


Let .A(f) be a geometric quantity determined by the closed curve f, and let g and w 
be functions from R/£Z to R". We use 6 and 67 to denote 


57 (flgl = 


’ 


€=0 


Alo, wl = an Dept EAP + E02) 


€1=E€2=0 
Calculating the first and second variational formulae of the Mébius energy for- 
mally, we have that 


6é sited eae eiaiiex, 
otal = |/ Gosia ) aoe 


; Able oe .»» |dsids>. 
bEFIID, Wl I Grexstene i ) _- 


These expressions contain many terms that are not integrable. He [7] obtained these 
formulae as the integral of Cauchy’s principal value. Ishizeki [8] investigates the ab- 
solute integrability in the regularity class C?**. 


Theorem 3.4.2 ([8]). Iff, @ and wp € C?**(R/LZ), then the integrands of 5€ and 67 
are absolutely integrable. 


This result can be improved for the proper domain by using the decomposition. 
Indeed, using this, we can derive the explicit expressions of variational formulae 
and reasonable estimates in several function spaces, including absolute integrability 
(11, 13]). 
The first variation Y and the second variation .% of the decomposed energies €; 
are given by 
G(l@] dsidsz = 5(.4,(f) ds; ds2)[g], 


SAPD, W) dsidsz = 6° (.U(F) dsids.)lg, wl. 


Next, we obtain the explicit expressions of variations. To present the corresponding 
statement, we first introduce several notations. For a function v on R/ LZ, the opera- 
tions Q, Q;, S, and S; are defined as 


Qv=Av, Ov =(-1)'12{v; - Rf -7)Rv}, 


|As|Av “ 1% 8G 
’ R S b 
[fie as? Y= 71 + ¥2) 


S(v, w) = Rv-Qw+Qv-Rw, S;(v, w) = Rv- O;w+ O,v- Rw, 


Rv= 


where the operation R is the same one as in the proof of Theorem 3.4.1. 
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Proposition 3.4.1. We have that 
Of Op  2.M(f)Af - Ap 


AP] = 


AF ll AFI, 
Oif-Qp+Qf-ip 2a(faf-Ap 
G. 
ey) 2\|OF len |4filz. 
and 
; Qp-Qwp Sf, p)Sf, p) 
KH ; = 
Ole. We Fz, «AFR 
2a (Plpl4f-Ap 2APlWlAf-Ap 2Mm(f/Ap-Ap 
AF lin AF llizn AF lin . 
Qi9-Qp+Qp-Qip . Sif, p)So(f, W) + Saf, MSi(f, W) 
KH " 
ee 2\|AF zn 2\/4F iz. 
2%, (f\lpl4f-Ap 2 (fA[IwlAf- Ap 2am(f\Ap-Ap 
AF lfm AF lien AF lin 
This is proven as follows. Set 
MEF) 
Mf) = 
Ve AF \lzn 


Then, it holds that 


MO = FF OF, Mf =-F OF - Of. 


The above relation for .1 is easy to prove. We have already demonstrated the one for 
A, in the proof of Corollary 3.4.1. Therefore, to obtain the expressions of Y and #%, 
we require those of variations of .%, ||Af|||n, and ds; ds. We begin with the variations 
of basic quantities. 


Lemma 3.4.1. The following first variational formulae hold: 
1. St[p] = p -(t-@)rt. 
2. S|AT||2nl@] = 2AT- Ap — ||AT||Zn(T1 + Oy + T2 + @>). 
1 2Af -Ap 
3. 6 yt ]= ; 
IAF) AF 
4, 6(ds;)lp] = Tj Q; dsj. 


Because the proof is not difficult, we omit it. As a consequence of this lemma, we ob- 
tain that 


6.% (flp] = Of -Qp - (11°, + 72° P)AVH), 


6. gl = - 5 (Of -Q2.@ + Qof - O1@) - (11: + T2* Ms)-N(f). 
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From Lemma 3.4.1, we have that 
G(Plpl dsidsz = 6.4 (f\lp] dsidsz + 4; (f) 6(ds; ds2)[@]) 
= {6.algl + U(f\(t1 > p+ T° @)\dsidso, 


i.e., 
G(Plp] = 6.a(Alpl+ GPl(t1- Pp, +T2° Py). 


Now, we are in the position to calculate Y. Using Lemma 3.4.1 again, we have that 


sprig) = SAMlol WO5( Ger lo) 


lafil2, [AF iz. 
_— bMPlg] _ 2.d(fAF Ap 
[fiz l4flz 


By inserting the expressions for 6.%j, we obtain Proposition 3.4.1. 
Similarly, we can obtain the expressions for .#%. See [11] for details. 


We can derive estimates of variational formulae in several function spaces. 


Theorem 3.4.3. We set X = H2(R/£Z) nN W)*(R/LZ), and Y = H2(R/LZ) 0 


L*(R/£Z). Assume that there exists a positive constant A such that ||Af||pn 
A" | AF (s1); f(s2))|- 


1. Iff, p, and w € X, then.4(f), (Ap), and A(f)[g, w] belong to L1(( R/LZ)*). 
Furthermore, there exists a positive constant C, depending on || f || y and A, such 


that 
lA lnqeyezyy SC, 


IGA Plilncercz) S Clg lly. 
|APID, Wllnercz) S Cl 'llvllp lly- 


2. If f, p, and w ¢ C1(R/LZ), then .“4(f), G(AIwl, and 4 (Aig, wp] belong 
to L*((R/£Z)*). Furthermore, there exists a positive constant C, depending on 


If licoageycr)» Aand L, such that 
| MAlz~-ceyeze) SC, 


IAPLPlilr-(e@yez) S CV llcor~escr)> 


|lAPLP, Wl ilr-(ejcz) S Cl@ llcorwesceyllP llcoueycr): 


3. Iff, p, and wp € C*(R/LZ), then .4%(f), AAW], 4API[@, w] can be extended 
on the diagonal set {(s1, S2)| 51 = S2 (mod £Z)} such that these functions are 


continuous everywhere. The limits of sum vanish on the diagonal set: 


lim (al + M2(f)) = 0, 


(si,52)+(s,s 
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lim (aHlel + #%(f)lp)) = 0, 


(s1,52)—+(s,s 


lim (4P/lg. Y~l+ AHIg, wp) = 0. 


(s1,52)—>(s,s) 


Furthermore, there exists a positive constant C, depending on ||f || C\(R/£R)» Aand 
£, such that 
| GP compen) SC; 


IAM ll coe@pczy S CP Icuescr)> 
IAAL, Wl ilooepez) S Cl llowycell¥ lower): 


The estimates in Theorem 3.4.3 follow from next lemma. 


Lemma 3.4.2. 1. For v € X, the following estimate holds: 


Qv 
AF (si), F(s2)) 


S |lv lly. 
L2(R/£Z)) 


2. Forv € Cl!(R/LZ), the following estimate holds: 


ott HV! 
DEF (s1), f(s2)) 


<|lv l|co.1.@@/£z): 
L=((R/£Z)*) 


3, Assume that v € C*(R/LZ). If we set Qv|,_, _,, = v , then Qv is continuous ev- 


erywhere, and 


=S2 


Qv 
IF (si), F(s2)) 


4, Assume that f € X and that ||Af||px => A"|A(F(s1), F(s2))|. Then, there exists a 


positive constant C, depending on |/f ||y and A, such that 


S |v lc1(R/ez)- 
C((R/L£Z)2) 


Oiv r 
DGG), Fe) = Cllv ly 


L?((R/£Z)) 


holds for all v € X. 
5. Assume that f € C!!(R/£Z) and that ||Af||gx => A| A(F(s1), F(s2))|. Then, there 
exists a positive constant C, depending on || f || co1(R/£z)> A, and £, such that 
Qiv 
AF (s1), Fls2)) 


holds for all v € C!(R/L£Z). 


SC|lv ll co.1.@R/£2) 


L=((R/£Z)) 
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6. Assume that f ¢€ C?(R/£Z) and that f has no self-intersections. For v ¢ 
C?(R/£Z), Q;v is continuous everywhere by setting Q;v =v’. If we further 


S=S1=S2 
assume that ||Af||pn = A| A(F(s1), f(s2))|, then, there exists a positive constant 


C, depending on IF llceyez)> A, and £, such that 


Oi 
AF (s1), F(s2)) 
holds for all v € C?(R/£Z). 


SCllv llceycz) 
C°((R/LZ)?) 


Proof. Without loss of generality, we may assume that |s; - s2| < 4, and for sim- 
plicity we use |As| instead of A(f(s1), f(s2)). The assertion for Qv almost matches the 
definition of norms. Indeed, we immediately have that 


Qv , ’ 
a =[v]_1 S |Iv lly 
| AS || 2(R/27)2) HAAR G2) 
and 6 
Vv , , 
| As = |IV lltip S|lv llco1@@/ez): 
L»((R/£Z)*) 


If v € C*(R/LZ), then it is easy to see that 


im Qv = v (s), 
(s1,82)—>(s,8) As 
and 
S1 
Qv 1 / 
ae = max v (s)ds|}  S|lv |laeyez): 
| AS ||co(rjez)2) |as|S4|] $1 ~ $2 


S2 Rr 


To show that the assertion for Q;v holds, we decompose oe Qiv = v;-(Rf-T))RV 
into 


v; -(Rf + 7)RV (vi a) +(% RV) +(1- RFT) RV = Vi + Va + Vo. 


We must derive L?, L~, and C° estimates for each V;/As. Because these are rather 
complicated, we refer the reader to [11]. 


We now present the proof of Theorem 3.4.3. Let Q be Q or Q;. Then, we have that 


i Of Of 
< 
4D! 5 FEC), FC) llen | AFG), FED) lan’ 
for i = 1, 2. Similarly, from Proposition 3.4.1 it can be derived that 
@ <-2 of ap (p)\||_—_4¢___ 
ADLPM S * G, Fa)|| anil DFGD, FSD) len? All DD Fe | 
2 Of Qp 
SM FED, FED | gnl| PCD, FG lle? 7A AMP lun: 
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Let R be R or R, and let S be S or S;. Then, the definitions of these operations yield that 


S(v, w) 
AEF(s1), f(s2)) 
p Qw Qv 
S | RV||ian Pf (s1), F(s2)) i an RW le 
Qw av 
(Wh ep.) FED Ign * | BORED FS) |i) 


Therefore, Proposition 3.4.1 implies that 
Qyw 

R AF(si), fe Rr 
“(thio age ren 


x (i , oy 
MP! DF (si), F(S2)) |len 
+ 2A7|Y;(FIIQI|||Pllrip + 2A7|G(A[P1||| Pl lip 
+ 207| A(F)\NIP llrip ll Pllzip- 


G(Nlp, wl) <2? 


Qp 
he sais 


Aen Fe|,.!%"™) 


__ lili) 


Consequently, the estimates in Theorem 3.4.3 can easily be derived using Lemma 3.4.2. 
If f € C*(R/LZ), then Lemma 3.4.2 yields that 


lim (Al) + al) = FI (ide ~ FIFI» = 0 


(s1,S2)—+(s,s 


Similarly, we can show that both of the limits of Y,(f)+%(f) and 4 (f)+%4 (f/) vanish. 


Theorem 3.4.3 shows that the first variation 6€;(f)[-] is a linear form on the space 
X = H2(R/LZ) 0 W(R/LZ). If f € H3(R/LZ), then it seems that the first variation 
can be extended into L*(IR/£Z) as a linear form by a formal integration by parts. 
Indeed, the principal term of 6€;(f) is 


(F (s1) - f'(s2))  (@'(s1) - 952) 4g 
i IF(s1) - F(S2)|I2n eae 


(R/£Z)? 


By considering bi-Lipschitz continuity, we replace the denominator with 


AF (s1), f(sz))*, so that 


(F (s1) - Ff (2) - (@'(s1) - @'(52)) 
AF (S1). Flsa)) asides 


(R/£Z) 


=2n / (-A,)if -(-As)*@' ds. 
(R/LZ) 
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Here, As = 02 is the Laplace operator with respect to s, not As = sj; — sp. Integrating 
by parts formally, we obtain that 


on / (-As)?f -@ ds. 
(R/£Z) 
This seems to be meaningful for f ¢ H?(R/£Z) and p © L?(R/LZ). Indeed, we can 


justify this for not only the principal term, but also all terms, including 6€>. 
Here, we define a new operation Tr in order to describe the L?-gradient of 6€; as 


a aks pga 
nes (ins) as 7 


Theorem 3.4.4. Let f € H? be bi-Lipschitz. Then, for p € L? it holds that 


6€iAlpl = LF+Ni), P)p, 


where 
2nk |? 8 S 
Lif = 2n(-As)?f - 437) | sil|k)(F, Pr)2Px + FAs -P, 
keZ 
4 3. 8wal2ak|’, es BD ns. 
Lof = - 31(-As)*f + 5 a silk) F, Pre GK+ 5 zh + sf -P 


sinA 


ae dis ae zen (2208 


si(t) = - 


) Fis) =fls+ 4), 


“2 
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N,(f)(s:) = -2 / { gap tTif- ar-.ae, bas; 


R/LZ 
-4 f GO rip + Bf mitt) Flsl bso 


As 
{ (Tif Ta) TBF + (T3F-t2)ToF bds2 


1 
N ae 
f(s) = -4 } mene 
R/LZ 


ai / Gays (TF TOTOF + (TRF eaITHF 


R/LZ 
+2{ (13F 42) +1} (TF rT | as. 


-4 f app [tient te 


R/LZ 
+ 2{ (TRF -t2) + AN (TEF ta) + TRF 72 


As)” 
es) Im: | 71d. 


6 
M- 1 1 
-4 / | aT + Fe {a + Flaite be dso 
R/LZ 


Furthermore, for a € (0, +) it holds that 
INiMlzz S Calf llqs-«). 


Our strategy for proving the above theorem is as follows. Because C*(R/£Z) is dense 
in both H?(R/£Z) and L?(R/ LZ), we can assume that f and @ are sufficiently smooth. 


According to Theorem 3.4.3, the first variation 6€,(-)[-] can be expressed as 


5e(Nlgl = i ) Gl] dsids> 


(R/ LZ)? 


2 
= Gi(f, p)(s1, 82) dsids>, 


Fipjezp 
where 
_ Qaf : Q29+ Qirf -Qiu@ 
Giff, ~) 2|AF 2 
. 2M (fAf + Ap 
Golf, ?) Wale > 


Qi1=Q12=Q, Qzjv= 2{v; — (Rf - 7;)Rv}. 
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We decompose these operations Q;; as Qijv = QV + Qi, where 
As 


O,\v = fe _ (Rf: r)Rv} = 241 -(RF a) eel LY, 


Q=Q;=02, Q;=0, sv =2(v\- 5%), 


Then, we have that 


3 
Gil, p) = >> Ginklf, 9), 


k=1 
Gailf, @) = Oinf - 22 * Caf Qinp 

1 # i 
Gii2F, @) = 5 CA Of On @ + Qf - Qin), 

; _ Onf - Q29 + Qnf - O29 + Qinf -Qo@ 
nee) 2 AF zn 
, Unf Qing + Qinf - 0n@ + OQiaf - Qip 
2\AF llzn 


We have that G;1; is linear with respect to f, but, Gj12, Gj13, and G;2 are not. Next, we 
would like to write 


Giilf, @) dsids> = (Lif, @) 2, 
(R/LZ)2 


| (Gir(f, 9) + Gusf. p) + Gif, p)) dsidsz =(Ni(), @) 12, 
(R/LZ)2 


where L; and N; are linear and nonlinear operations from H? to L?, which we will then 
estimate. 
For the linear parts L; we use the Fourier expansion 


f=S ona, P= >> ody, 


keZ keZ 
where {,;} is a complete orthogonal basis of L*(R/£Z). Using the fact that 
2nikh 


ey(s)= Fos) and gy(s-+h) = exp( 2") ox(s), 


we have that 


F (si) Fe, +) = So EL a — exp( 7) botovda, 


keZ 


p (si)- (sith) = S~ mk exp (=) \ou(sr)b 


keZ 
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Consequently, orthogonality implies that 


| Giiilf, p) dsids2 


(R/LZ)2 
t 
Fs) -Fls1 + W)- (91S) 91+) ans, 
R/LZ-£ 
L 
‘Da 2akh 
Ink 2 1-cos| “-— 
= 2° (2) / oes ds1 (ay, By)c 
keZ “6 
2 
27k | 1 A 
4 — cos 
= 25— Ee / 12 da (ax, bic 
keZ n\k| 
Similarly, we have that 
Ga1ilf, p) dsids2 
(R/LZ)2 
P| 7p M2 cosA- 2AsinA A 
=32 21 A4{A* cosA - 2AsinA + 2(1 - cosA)} dA| (ay, By) 
L M4 
keZ -n\k| 
Thus, we arrive at 
ses n\k| 
4 
Lf=|7E) | f a@aa) 6. 9: r@)e0e 
keZ —71| k| 
where 
z(A) = 2(1 aed: n(A) = 4{A? cosA - ZA SAS 2(1 = cos A)} 
It is not difficult to see that 
m\k| 
/ 2\(A) dA = aj(m + 2si(\kn|)) + 2b,|km|z;(|kn)), 
-n\k| 


where 


(en 
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From the definition of z;, it follows that 


2{1-cos(\km|)} _ 2{1 - (-1)*} 


|kr|Z1(|kzt|) = 


|kr| |ka} 

lket\z>(\ker|) = Sele cost) + 21 shane m|) — 2{1 - cos(|kz|)} 
_ Al-(-1)* | kal? - 2{4 - (-1)*}] 
|kn|? ; 


By combining these with the expression 


SO, Prde@yen PK =f, 

kEZ 

si 
L 


keZ 


Ff, Px)rRj cz) Pk = —Asf, 


¥ 


SCE, Pk) L2(R/ LZ) Pk =F, 


keZ 


ank|* 
L 


(-1)*(f, Px12(R/ cz) Pk = —Asf 


» 


keZ 


we obtain the same expressions for the linear parts L; as in Theorem 3.4.4. 

Because the derivation of the nonlinear parts N; is considerably more compli- 
cated, we only provide a sketch. For details, see [13]. 

We employ G(f, p) as one of Gii,(f, @) (k = 2, 3) or Gi2(f, @), which then take 
the form 


GF, p) =G,(f) Ap +G,(f)-Ap+G.(f)- p (s1) + G(f)- p (s2). 


The following can be obtained through a simple calculation. 


Lemma 3.4.3. The following relations hold: 


1. 
a £(s1, 52)» @ (s1) dsids> = i (G(s, s+ €)- ¢(s,s-e)) -@(s)ds 
|s1-s2|Ze Kine 
2 // so S(S1.82) (ss) dds, 
|s1-S2|Ze 
2. 


/ £(S1, $2) « @ ($2) dsids2 = / (¢(s+e,s)-¢(s—e€,s)) - p(s) ds 


|s1-S2|2e R/LZ 


- // 5a $81.82)“ ls2) dsrds, 


|s1-S2|Ze 
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// O(S1, 52): Ag ds,ds> = // (¢(s1, So) - O(S2, S1)) - p(s1) dsyds> 


|s1-S2|2e |s1-So|De 


i, {(s1, $2) - A’ dsids> 


|s1-S2|2e 


= i, ({(s,st+e)-C(s+e,s)-C(s,s—e)+ C(s—e,s))- p(s) ds 


R/LZ 


: // 5 ls1,82)- $62, $1)) > p(si) ds,ds>. 
1 


|s1-So|Ze 
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From Lemma 3.4.3, we obtain that 


Gf, p)dsids2 = lim, He | G(f, p) dsidsz 


(R/£Z)2 |s1-S2|Ze 


€>+0 


in| // G,(f)-Ap'dsdss+ | G,(f) -Ap dsids, 


|s1-S2|Ze |s1-S2|Ze 


+ i G.(f)-p (s1) ds,ds> + // cob dude 


|si-s2|2e |si-S2|Ze 


nf / (G,(f)(s,s+e)-G,(f\(s +e, s) 
R 


jeu 
-G,(f)(s,s -€)- G, (f(s — €, s)) - p(s) ds 
2 / i 5a: (G,(U51. 82) - G,(Nls2, 81)) - PlS1) dsrds, 


|S1-S2|2e 


a // (G,(F)(s1, $2) = G,(f)(s2, 51) - p(si) dsidsz 


|si-S2|Ze 


‘ if (G,(f\ls,s +e) - G.(f(s, 8-8) - gs) ds 


R/LZ 
7 // 52 Gls: 52) tsi) dsids2 
1 


|S1-S2|Ze 


: i, (G,(f)(s + 2,5) -G,(f\(s - €,5)) « pls) ds 


R/LZ 


= // $60.20 eindnn| 
2 


|s1-S2|Ze 


= (1). 
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Here, we will prove that if f ¢ H?, then it holds that 


(t) = [Phos (G,(P\(s1, 52) - G,(f)(s2.51)) 


(R/£Z) 


+G eas es 
32 G.(Ns1, 52) - 32.6 2 G,(fils2.si)} apts Nderds 
= (4). 


Furthermore, we shall verify that 


nepid= | {-(6,ls.s2)- 6,629) 
R/LZ 


+G, $2) — G,(f)(s2, 5) 
2 G(s, s2)- 26 Hs2.5)} ds; 


is well-defined at #1-a.e. s € R/LZ, so that we can apply Fubini’s theorem, which 
implies that 
= [ NNG)- 9(8) ds = (ND. Peyc2 
R/LZ 
We will also show that N(f) is a lower order term, whose order is less than three, 
and 
INP ilzz S CF llzs-«, A). 


We employ Nj; or Nj2 as counterparts of N when G = Gj1; or Gj2, respectively. 
Here, we will present details for N12 only. For this, we require the following facts, 
which we state without their proof. 


Lemma 3.4.4. Let x € L®. Then, it holds that 
Tif =O(As), TPf-+t(sj)=O(As)*, TPF: a = 0(As)?. 


Suppose that f is bi-Lipschitz and x € L®. For k = 1, it holds that 


= 


TKf =O(As), TK +1(s;)=O(As)? TK - = = O(As)’. 


Lemma 3.4.5. It holds that 


2(-1) 


(as)> TIF 78) 


) 
OS; Af) = 
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It is easy to see that 
Gil, P) =S1nlf) Ag, 


where 


£112 (f)(S1, $2) = Mf Af . 


In the following, we denote g,,5(f)(S1, 52) by $117(S1, S2), for convenience. 


Lemma 3.4.6. Let a € (0, 1). If f ¢ H?-“(R/£Z), it follows that 


Gi12f, P)(S1, $2) dsidsz = (Ni12(f), @) 22, 
(R/£R)2 
where 
ae 


Ni(fie)=2_ [ { ap 


R/LZ 


(rif - x(s,))Ar - Moms) | dso, 


INir2A\lz2 S CCF | q3-«). 


Proof. Using Lemma 3.4.3, we have that 
/ Gii2lf, p) dsids2 
|si-s2|Ze 


= / Sip) Ap ds, ds» 


|si-so|Ze 


= / ($149(S,S + €) — 8119(S + €, 8) -— $449(S,S — €) + 8440(s — €, 5))- ls) ds 
R/LZ 


= // 5a; Gra(81, 52) - S112l62, 81) C61) dds. 


|si-s2|Z€ 
=(*). 


W remark that 
$112(S1, S2)= MPAf = A(f)At = OAs), 


which implies that 
8110(S,8 + €) — $440(S + €, S) — 8440(S, 5 — €) + 8449(S —€, 8) = Ole) (€ 40). 


Note that O(¢) is uniform with regard to s ¢ R/£Z, and in the following, we employ 
the above notation with this meaning. Because it holds that 


£112(S1, $2) — $112(S2, $1) = 2.W(fAT, 
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we can apply Lemmas 3.4.4—3.4.5 to infer that 


3 {8112(S1, $2) — $112(S2, $1) } 
1 
(far. a 


OS) 
-2{- aaa Tif -tes))at + apxts,)} 


= 0(As) **?* + 0(4) 
=QO(1) as AsO 


which implies that Ne {8449(S1, $2) — $449(S2, $1)} € L™ ((R/£Z)*). Therefore, the 


expression is absolutely: integrable, and we can use Fubini’s theorem in order to obtain 
that 


(*) > - Ai 2{- Gis cag (Pit r(s,))ae +. (Pyx(ss)} pls) 4514s; 


(R/£Z)? 


-fief {as (Tif -rs:))ar- a ports) dss - (sr) ds 


R/LZ R/LZ 


= (Ni2(f), P),25 


by letting « — +0 in (*). Because it holds that 


Il llz= = IF lt» S Calf llne-« 


fora € (0, 5) the bound on the integrand of N12 follows from Lemma 3.4.4. Thus, 
we have that 


INir2A)\lz2 S CllFllips-«- 


Although the proof is more complicated, a similar result holds for G212. Because 
G13 = 0, no further work is necessary in this case. We must consider G23 very care- 
fully. This is decomposed as 


G23, P) = Grisa(f) -A@ + Gri3c(f) - p (s1) + Grisp(f)- @ (s2), 
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where 
Gal goa {Oe rts) (Tif + af) cry r(s2))( THF + x) 
(As)|AFI2, 
+ 2(Tif rs (THF ea ZE I, 
af, 
Gri3c(f) = “Ta — (TS f'- t(s2) 55 
af 
G T 
213p(f) = “ane: TTF TS) 55 


Because none of G213z, G213c, and G13p are absolutely integrable, we must to com- 
bine them appropriately. We require a similar treatment for G;2, but the stuation is less 
complex than that of G23. For details, see [13]. 


3.4.3 The Mobius invariance 
The Mobius invariance of each decomposed energy €; is discussed in [10, 12]. 


Theorem 3.4.5 ({10]). Each €; is Mébius invariant in the following sense: 
1. It is invariant under rigid motion and dilation. 


2. Let 
r(f-o) 
IF - Cll 
be the inversion with respect to sphere with center c and radius r. 
(1) Iff ¢ Wt1(R/LZ), then €(f) + E2(f) = €1(—p) + Er2(p). 
(2) Ife ¢Imf and if E(f) < o, then €;(f) = €1(p) and €2(f) = €2(p). 


frp=ctr 


Remark 3.4.1. In the assertion (1) we assume that neither c ¢ Imf nor E(f) < ov. 


Because the invariance under rigid motion and dilation can be easily proved, we focus 
on the invariance under inversion with respect to the sphere. Although the invariance 
of (4 + .@) ds; ds» was shown in [5], we priovide the proof here. First, we note that 


1+7(si)-T(s2)  2(Af - t(s1)) (AF + t(s2)) 
AF |ln IAF llr 
Even if s is an arc-length parameter for f, it is not necessarily one for p. Therefore, we 


use a general parameter 0 instead of s, and the energy density with respect to d0,d0 
is 


Mf) + olf) = 


IF (01) ||ien \|F (02) llien + (1) - (02) 
IF (01) — F(O2)|l2n 


+ 5 (39; louis) - £060)» ) (9- low (0s) F062). 


(Af) + of) IFO Ile IIF(O2) lz = 
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Heref means the differentiation of f with respect to the general parameter, and simi- 
larly for other functions. 


Theorem 3.4.6. Let 
rf-o) 
r= ait 
be an inversion with respect to the sphere with center c and radius r. Then, it holds 
that 


fr p=ct 


(Af) + ZF) FO l|xn|FO2) |e» - (1) + 42) 1541) ie" ||(2) ||» = 0 
(3.4.8) 


for 6, and @> such that 


F(91) FF(O2), FU) 4e G=1, 2). 


Proof. We decompose the difference between the densities for f and for p as follows: 


(Af) + AF) FO lle IF (2) lien — (Za) + (p)) ||P) |e" (82) lle 
=Ji+Jo+Js, 


where 


FO )ilwr|IFC2)Ihe — (1591) Ihe" || (92) len 
If (01) - Fa) llZn p61) — p(2)\len 
f(1) - f(@2) p(41) - p(62) 
F(1)-FOV)\zn p01) — Pa) |\Rn’ 


Ja= 54 ( 53 low iF) -F(02)|2 ) (59 1ou (61) - 02) Re} 


i= 


Jr = 


) ) 
- ( 39; lor lle(@s)- ptO2)I2e ) (sg. los llp(@s) - ptC2)I2e) 
It holds that 


\|p (01) — p(42)||n 
Pr (f(6:)-¢)  P(F(@2)-e) |” 
IF(O1)- clin [IF@2)-ell2n || 
ee: 1 2(f(61) - c) - (F(@2) - ¢) s 1 

WFO) -ellz.  F(1) - eli. IF2)- el2, F(z) - ell2. 


_ FO) - F(O2)|lfin 
IF (01) - €l|nlF(E2) - Clin 
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If we define the projection P-(@) and P2 (6) for a vector v by 


ii MOON HOR» seh Sos 
Pow = (vais) Hele’ POI PO), 


then the derivative of p(@) may be expressed as 


30) =? #0 2 O-(FO-6) (FO - 9] 


\IF(8) - Clin \IF(8) — Clan 


= r? lt , 
IF) — cl, (P c (8) P(6)) (6). 


Therefore, we have that 


[B(9)|lie = (|lPe (OF (lz + \\Pc(@FO)|i2n) 


r4 
IF (8) - Cllpn 
_ AWOL 


~ (F@)= elif,” 
As a straightforward consequence, we have that 
||P (91) lie» || (2) IIe» 
\|P(91) — p(82)|lin 


_ PF Odie 17 FO2)\len_ F(x) - ell fz) - elle 
IF (1) — Clin IF(O2)— Cllan 4 IF(O1) - Fa) lian 


_ WF rhe (O2) he 
IF (1) - F(O2) Ilan” 


which demonstrates that J; = 0. By a similar calculation, it holds that 


BO:)- pO) {(~ 2Pc(81)) F(x} - { (I~ 2Pe(62)) F(@2)} 


\|P(91) - P(92)|lin IIF(61) - Fa) \|in 


By observing that 


(I- 2Pc(6,)) F(@:) = f@i) 2( F(6) -¢ -#(@)) FG 


FO) ~ elm IF) ~ elm 
= f(6) - & log |if(@,) - cl (F(0) -c), 
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we may write 


J2= 


1 
(FG) - F@DIz 
« |F061)- 02) { F061) - (59- lowli(@1)~ el ) (F061) - 6) } 


{F(62) - (3 log lif(0s) - cle) (F062) -¢)}| 


_ 1 
IIF(61) - Fa) ln 


(a3 : gst) ~ee){t0)-€)-H) 

+ ( 5510s (G2) ~ els) { (7062) ~ ) -F6.)} 

-( 53; low iF) -e2») ( 53- Tow iF) ~ el 
x {(f(61) - ¢) - (f(@2) -¢) 

Using the facts that 


ieee SAC (02) + f(O2) - €) - f(@2) 
= 5 595 (-F(O1) ~F(6) ibe + Fr) ~ l2e) 


and 
oe ae f(01) + f(01) -¢) - f(61) 
= 5 gy (-IF COs) ~ F062) ibe + Fs) - lRe) 


we atrive at 


(f(61) -c) -f(@2) 
\IF(O1) — F(02)||2, 


1 Oo 


= 8 log ||f(01) - F(@2)|lin + 


(f(82) -c) -f(@1) 
\IF(O1) — F(02)||2, 


: 5 sae log ||f(81) ~ f (02) IIx + 


1 ||f(@2)-ellgm» 0 
2 \|F(61) - F(O2)\|5n 08 


| los |IF(G2) - C\lgn, 


1 |[f(@1)-ellg» 0 
2 \|F(61) - F(O2)||5n 08 


~ log IF.) - Clan. 
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Thus, our previous expression for J2 becomes 


== 3 (33: log ||F(@1) - elie) (59; loeli(@.) - (02) 


-$(s95 log |If(61) - f( 6220) ( sz Toe if) ~ el 


(2 veto et) (3; loelif(02) - el 


 |FO2) - Cll Zn + ||F(01) — clln — 2(F (1) — ¢) - (F (2) - ¢) 
IIF(01) — F(O2)|l5n 


--5 (39, lonli(@1)- el ) (59, lonlf(@.) - (02) 


mee 5 (se, log ||f(01) - F(62)I% (2. log ||f(62) - cl 


1 


+ (59; 108 0.) ~ ele) (55 low iF.) ~ el). 


Finally, we have that 
< 6) 
39, 108 LP (61) - PC)» = 55 (log f(x) - F(62) i» — log | f(6) - ella»), 


and therefore 


J3 


(59; loulit@1)- el) ( 55- louli*@1) - 162) 12e) 
+5 (39; losin) -F002) Ike) (59, lonlf(@2)- ele) 
1 
2 


53; 108 UF(61)~ el) ( 53- low f(x) ~ ele 


Corollary 3.4.2. Let f ¢ H'+(R/27Z). Then it holds that 
Ex(f) + En(f) = E1(p) + En(p). 


Remark 3.4.2. This corollary does not exclude that the case that both sides are infi- 
nite. 


Proof. First, let f ¢ H 2 (R/27Z). If the 2-dimensional Lebesgue measure of 
{(01, 62) € (R/2nZ)? | (01) = f(02)} 
is positive, then so is that of 


{(01, 62) € (R/27Z)* | p(61) = p(2)}. 
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Therefore, €(f) = €(p) = ce, and we have that 
Ei(f) + E2(f) = E1(p) + E2(p) = 09 
Thus, we assume that the 2-dimensional Lebesgue measure of 
{(01, 82) € (R/2nZ)’ | f(1) = f(82)} 
is O, and we claim that the measure of 
{(@1, 82) € (R/2Z)? | F(1) = F(2) or f(O1) = € or f(82) = c} 


is also O. In order to see this, considering f as a function of s on R/LZ, we need to 
prove that 
S={s € R/LZ| f(s) = c} 


is a finite set. To argue by contradiction, we suppose that S is not a finite set. Then, by 
the compactness of R/ LZ, there exists a sequence such that f(s j) =cand lim s j= Se 
jroo 


From the fact that 
Sj+1 Sj+1 
0= Ufl5j.1)-F06 Ibe = | f (8)-1(8)) sds 
sj Sj 
Sj+1 Sj+1 
= (Sj. - 5)" - 5 / \It(s) - t(s )||2ndsds , 
Sj sj 
it follows that 
Sj+1 Sj+1 
a \|t(s) - In(s) = 71S lie dé ay lIt(s) - TS )Ilam gee’ 
(s-s')? (Sju1 — $))? 
Sj Sj Sj Sj 


However, considering the fact that tT ¢ H 2 (R/£Z) and the absolute continuity of in- 
tegral, this implies that 


Sjsa Sjut N19 
2< tim [ | 7(s) - TS Ile» aeds’ = 0 


jee (s - s’)? 


which is obviously a contradiction. By these arguments, we find that (3.4.8) holds for 
L?-a.e. (6;, 82), and the desired conclusion follows by integrating this expression. 
Finally, we consider the case that f ¢ H 3 (R/27Z), which implies that €(f) = co 
Then, we will show that €(p) = oe. Again, arguing by contradiction, we suppose that 
E(p) < co from which we know that p does not Haye self-intersections. Furthermore, it 


holds that p € H;., and we remark that p € H? if Pp does not pass through the point 


loc? 
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at infinity. If we turn p back by inversion with respect to the sphere with center c and 
radius r, then it returns to f. Because f does not pass through the point at infinity, p 
does not pass through c. Therefore the 2-dimensional Lebesgue measure of 


{(@1, 82) | p(@1) = p(@2), or p(@,) = c or p(O2) = c} 


is 0. This implies that (3.4.8) holds for Y?-a.e. (91, 02), and by integrating this, we 
obtain that 
Ex (ff) + Ex(f) = E1(p) + Exp) = E(p) - 4 < o@. 


However, from this we obtain that co = E(f) = €,(f)+ €2(f) +4 < cv, whichis obviously 
a contradiction. We can conclude that it holds that €(p) = o<, and so €1(f) + E2(f) = 
E1(p) + E2(p) = ov, as desired. 


We discuss the invariance of each energy €; under the inversion f > p. 


Theorem 3.4.7. Assume that the center c of the inversion is not in the image of f. We 
also assume that the energy €(f) is finite. Then, we have that 


E1(f) =E1(p),  En(f) = En(p). 


Proof. In view of Corollary 3.4.2, it is sufficient to prove that €1(f) = €1(p). Let J; and 
J2 be defined as in the proof of Theorem 3.4.6. It follows that 


MAFF (O1) hie F(a) Ihe» -— Zi (P)||B(O1) hie ||B(O2)|Iie» = Ja — Jo = -J2. 


We need to prove that the integration of this goes to 0. From this point on, we use the 
arc-length variable s;. 

Because we are assuming that €(f) < co, we know that it holds that .“(f) ¢€ 
L'((R/£LZ)). Becuse .%(p) = 0, we may write 


. Ja 
E -€ =- | = : ds,ds>, 
Mose aa I FOdielF@dIe 


where |s; - S2| 2 € isin the sense of mod L. We remark that 


1 fe) _ 0 Sa 
FG )Im 28;°° 2 sj" 
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so we obtain that 


— lim ai = Jo D ds,ds2 
ee eS f(61) |e F(A 2) IIe 


= im, ff {( 52 loeliren)- el) (52 tos itn) - 052) 


|s1-S2|Ze 


(2 losis - flsa)ie) (5 2 Jog lif(s2) - cl 


“(es log f(s) ~ el (2 If) ~ cle) bass, 


Sit+L-€ 


) 
= lim, + aE Sar log iftss)- ele fs toglists.) ~F(s2)ledseds, 
Sit+€ 
SothL-€ 3 
+ lim 5 fa joy Hoult (62) -ele [50 log if (ss) -F(62)IRedsidsy 
S2+€ 


L 
_1 re) ani ) mn i 
; / 52 loglif(s1) - ends, / 575 108 Ifls2) ~ellRads>. 
0 0) 


Furthermore, it holds that 
SitL-€ 


rs) 2 2 = IIf(s1) - F(s1 - ©) |lin 
95y OS IlF(s1) Ff (S2)|lizndsz = log If) = FSi + 2, 


SitE 
Moreover, the fact that €(f) < co implies that f ¢ H 2 (R/£Z), and thus 
llf(s1) - f(s1 + €)|l2n = €7 + o(e?) 


uniformly with regard to s, as € — +0. Therefore, we have that 
si+L-€ 
~ log |[f(s1) - f(S2)||2nds2 = log(1 + 0(1)) = 0(4) 
site 
holds uniformly with regard to s;. From this, and the fact that f does not pass through 
c, it holds that 


L Si+L-€ 
F) F) 
/ <2 tog lif(s1) ~ll2e / <° tog lif(s1) — Fls2)|[ndszds, 
0S} OS? 
10) Sité 


ds, 


L 
- o(1) / ag; I fsx) el 
10) 


= 0(1). 
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Then, we arrive at 


Sit+L-€ 


L 
3 ms $5 3 Pe : 
lim, i. so log if (si) - els | jap 108 If (ss) Flsa))iBedsads1 = 0 
0) 


S{+E 
Similarly, we find that 


Sot+hL-€ 


& 
3 Ase 3 Lp eG 7 
Jim, [ 52-log\f(sa)~elzn [52 log ifts1) ~F(s2)2edsids2 = 0 
0 


Ste 


Finally, because f does not pass through c, we have that 


L L 
fe) 0 
/ hog Iif(s1) - elf3edsy | tog Iif(s2) - ellndsy = 0 
OS, 00> 
(@) 0 


from which the desired conclusion €;(f) = €(p) follows. 


Assume that Imf = S' and c € Imf. Then, p is a straight line. Therefore €,(f) = 27 
and €;(p) = 0. This shows that €;(f) = €1(p) does not hold unless c ¢ Imf. In fact, 
we have the following result. 


Theorem 3.4.8 ((12]). Let f ¢ Cl1(R/£Z). Then, for an inversion with center c on 
Im f, we have that 


E1(f) = €1(p) +277, En(f) = En(p) - 277”. 


Proof. We only provide a sketch of proof here. We refer the reader interested in the 
details to [12]. Let @ be a general parameter, as above. Under our assumption, f has no 
self-intersections, and so f passes through the center c only once at most per period. 
Thus, we may assume that c = f(0). Then, if @ #0 (mod 27), it holds that f(@) 4 c. 
As shown in [10], we have that 


AA PIF Ox)» IF)» ~ -@s(P)|IBO1) hell B(O)lie» = FIC, 82), B49) 


where 


51(61, 62) = (<3: log ||f(01) - FlO)|e (59; losli(@.) - 162) 
(2 log ||f(01) - fe IR») (se. log If(0) - F(O)|e 


: ( 2 me 000-0) (2g -f01h) 
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Because 
) %) 
30,0 7481 = 556-7) asi, 
it is easy to see that 
(01, 02)d0,d02 = / | J(81, 82) ds1dso, (3.4.10) 
(R/2nZ)2 (R/LZ)2 


where 
F (61.82) = (53 log ifs) - ODI ) (55: low f(s) - F162) 
+ (<2 los irtss)-f182)h%e)) (52, low Ufsa)- FOR} 
- (2 ese FlOD|Re (2 tose F(O)|e 


Here, we set s = 0 at 9 = O. We introduce a function ¥ replacing the Euclidean 
distance || - ||gn in .Y with the intrinsic distance 7: 


P8112) = (52 low (F181), £0)? ) ( 52 low (F161). F182)? 
2 (sos Alf(s1), F(S2)) ?) (Sloe 21F(2), flO) 2) 
- (2 log 2(F(s1), FO) | (53 log 9 Aft). FO)? ). 


We can show that 


i) F(S1, 82) ds,ds> = 4n (3.4.11) 
(R/LZ)2 
and 
i (£(s1, $2) - F(s1, 89) dsydsy = 0. (3.4.12) 


(R/£Z)? 


The assertion of Theorem easily follows from (3.4.9)-(3.4.12). There, we have used the 
existence of a bounded x almost everywhere, so we must assume that f € C11(R/LZ). 


Corollary 3.4.3. Right circles are the only global minimizers of €, in the class 
C1 1(R/LZ). 


Proof. Because .4%(p) = 0, it is clear that €;(p) = 0. Hence, €,(f) = €1(p) + 2m? = 
2n*. Furthermore, €,(f) = 277 holds if and only if €;(p) = 0, which is equivalent to 
(p) = 0. This implies that p(s) is a constant vector. That is, p is a straight line. 
Consequently, the pre-image f is a right circle. 
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As stated above, the lower bound of €; in C!!(R/£Z) is 277. This estimate seems to be 
related to the first eigenvalue of (-A;)?/*, which is the principal part of the L?-gradient, 


3 
is (=) . Indeed, by the bi-Lipschitz continuity of f, we have that 


2 
E1(f) =5 ‘ |AT lien ds,ds> ~ Ao sf Af llign dsids> 


IAF en 2 |As|? 
(R/£Z)? (R/LZ)2 
wn / (-45)""f" 2nds = 7 / I(-a3)>!F nds 
R/LZ R/LZ 
27m é 
=n | f--as?fds2 (22) J Wibeds. 
R/LZ R/LZ 


The equality in the above estimate implies that each component of f is the first eigen- 
function. This means that f is a right circle with the center at the origin and perimeter 
£. Therefore, 


_& 
Ifllan = 5. 


Consequently, we ontain that 


3 3 2 

21 2 __ (21 fF eee 

n(2Z) | Witeas - 2(22) & (+) =2n 
R/LZ 


The above is not a rigorous argument, but seems to the author to be of interest. 
It is still an open problem whether Theorem 3.4.8 holds for f € H?/2(R/£LZ) nN 
W?°(R/LZ). However we know a (non-sharp) lower estimate of €1. 


Proposition 3.4.2. We have that 


1 
1-cosu 


1 


for any f €¢ H?/2(R/LZ) Nn Wh (R/LZ). 


Remark 3.4.3. Note that the estimate for curves of C!1(IR/£Z) can be written as 


co 


1-cosu 


éx(f) = 2n? = an f 1688 du, 


-co 


Proof. Because the Euclidean distance cannot be longer than the intrinsic distance, 
we have that 


WatlBe Ariz 
PS wi 2 aFliz, 14 2 I] ZFS), Flea? 251482" 


R/L£Z)? 
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We insert the Fourier series 


f= >> prax 


keZ 


into the above. In a similar manner to derive the L*-gradient, we can obtain 


m\k| 
ATI 2nk|? / 1-cosu 
ds,ds) = a||en —— du. 
|] sagea.teae D|Z_| laelle we 
(R/LZP c -\k| 
By applying Parseval’s identity to 
2nik 
T=) PAs 
keZ 
we obtain that 
2nk|? 
I lli2@epez) = aig I|ayllen- 
keZ 
On the other hand, because 7 is a unit vector, it holds that 
IIT lliaceycz =F: 
Therefore it holds that 
2nk|? 
S- oa llaxllex = £. 
keZ 
Consequently, we arrive at 
n\k| fn : 
2nk|? 1-cosu 2m [1-cosu 2m 2 
= fenced 
SOAR llaniee f HSE au = Sf SY tu A al 
keZ _alk| a keZ 
1 
= 2n | 1- te u d 
u 


From the above proof, we can see that the given estimate is not sharp. The author 
expects that the lower bound of €, in H?/?(R/LZ) nN W!*(R/LZ) is also 272, and 
that right circles are the only minimizers. Blatt [3] helpfully informed the author that 
the answer may be affirmatively. 
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Henrik Schumacher 
Pseudogradient Flows of Geometric Energies 


Abstract: For differentiable functions on Riemannian manifolds, the gradient vector 
field and its induced gradient flow are well-defined and well-understood concepts. Un- 
fortunately, many nonquadratic, infinite-dimensional optimization problems cannot 
be formulated on Riemannian manifolds in a convenient way. We introduce a category 
of infinite-dimensional Banach manifolds that allows us to define a generalized gradi- 
ent. Moreover, we show short-time existence of the induced flows and apply their dis- 
cretizations to the numerical minimization of various geometric energies of immersed 
curves and surfaces. 


Keywords: gradient flow, minimal surfaces, Euler elastica, Willmore energy 


MSC: 49Q05, 49Q10 


4.1 Introduction 


Gradient flows are a popular tool for minimizing (or maximizing) continuously differ- 
entiable functions, both theoretically and numerically. Algorithms derived from gra- 
dient flows, often subsumed as methods of steepest descent, belong to virtually every 
library for (smooth) numerical optimization. The idea behind gradient flows is very 
intuitive and appealing: 

Let (M, g) be a Riemannian manifold, F: M —> R bea function of class C1, and 
a € Msome point. A tangent vector u € TaM of length r > 0 is called a direction of 
steepest descent of F at the point a <€ M of length r if it minimizes the slope S(u) := 
R(u)'(dF|a, u) among all tangent vectors of length r, where R(u) := g(u, u)?. That 
means, u is a solution of the following constraint optimization problem: 


Minimize S(u) over u € TgM subject to the constraint R(u) = r. (4.1.1) 


By introducing a scalar Lagrange multiplier A € IR, we have to solve the system 


R(u) r. 


oe = 0 forallvc TeM, 


This system is equivalent to 


1? (dFla, v) — (dFla, u)g(u,v)+Ar*g(u,v) = O  forallve TaM, 
g(u, u) r. 


Testing with v = u leads to A = O. Afterwards, testing with all v <¢ ker(dF|q) 
leads to u € ker(dF\q)+. Thus, the system is solved by u = o grad(F)|a with o € 
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{-r|dFla|z', 7 |dFla|z' }. Here, the gradient grad(F)|a € TaM is the vector defined by 
the equation 


g(grad(F)|q, v) := (dFla, Vv) for allv € TgM. (4.1.2) 


Moreover, |dF|q|g is the Frobenius norm of the linear form dF|q4: TaM — R with re- 
spect to the inner product g. Summarized, the direction of steepest descent of length 
ris given by 


u=-r|dFlalg’ grad(F)|a. 
Usually, one chooses r = |dF|a|g and defines the direction of steepest descent by 
u =-grad(F)|a. 


We point out a simple but often forgotten fact: One needs a metric d on TaM in order 
to be able to talk about the slope of a linear function €: TagM — R in direction u, the 
slope being given by 
(5, u) ~ (8, 0) 
d(u,0) ~ 

In particular, the notion of steepest descent requires the specification of a metric, at 
least on tangent spaces; different metrics will lead to different directions of steepest 
descent. There is no such thing as a direction of steepest descent per se. 

A flow @: Ux I Cc MxR — Mis called a downward gradient flow of F if it is 
generated by the vector field — grad(F), i.e., if it satisfies 


(a, t)=-grad(F)|ga,y and #(a,0)=a forall(a,t)¢UxlI. (4.1.3) 


Analogously, an upward gradient flow of F is generated by grad(F). Under mild regu- 
larity assumptions on F (e.g., dF is locally Lipschitz continuous), both the downward 
and upward flow exist at least locally and for short times. Moreover, each of the flows 
is unique. 

This theory carries over to infinite-dimensional manifolds as far as only Rieman- 
nian/Hilbert manifolds are considered. Unfortunately, many interesting variational 
problems can only be formulated as differentiable variational problems on Hilbert 
manifolds if one shrinks the feasible set to an extent that minimizers of the actual 
problem of interest might be excluded. Many geometric variational problems involv- 
ing nonquadratic energies and nonlinear constraints belong to this class; we discuss 
several examples in Section 4.5.1 and Section 4.5.2. The essential reason for this phe- 
nomenon is that the Sobolev spaces W*’*(R”;R) do not embed continuously into 
W?(R"; IR) for moderate n = 2 and smalls ¢€ [0,1 + 4]. Thus, a generic continuous 
functional on W1°(R”; R) can only be made continuous on W*’?(IR"; R) by choosing 
s overly large. 

This is why the generalization of gradients and gradient flows to more general 
infinite-dimensional manifolds is a quite delicate business. As far as we know, there 
have been the following two main approaches so far: 
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1. One may generalize gradients as directions of steepest descent in the sense of 
(4.1.1), using R(u) = ||ul|r,4 and r = ||dF\a||7, wy. If the Banach space TaM is re- 
flexive, this leads again to a well-defined gradient grad(F)|q € TaM. However, this 
gradient depends nonlinearly on dF|q. In practice, this means that a costly mini- 
mization or root finding algorithm has to be used in order to determine grad(F)|a. 
Moreover, even more costly algorithms have to be applied ifthe norm ||-|| 7,4 is not 
differentiable on TaM \ {0}. For more details on this kind of gradient flows and 
for generalizations to more general metric spaces, we refer the reader to [1]. 

2. In some cases, one may exploit a Gelfand triple Tg3M > Ha © (TaM), ie., ase- 
quence of continuous, linear, and dense embeddings, where (Ha, ga) is a Hilbert 
space. Then one may use (4.1.2) to define a “gradient” grad" (F)|q € Hg. Since 
grad” (F)|a ¢ TaM is not a tangent vector, the mapping a +> grad” (F)|a cannot 
be interpreted as a vector field so that (4.1.3) is no longer an ordinary differen- 
tial equation. Under certain conditions on dF, (4.1.3) may still be formulated as a 
parabolic partial differential equation; existence and uniqueness of the “gradient 
flow” can be ensured for short positive times t € ]0, e[. Prominent examples are 
the mean curvature flow (the L?-gradient flow of the surface area functional, see 
[5]) and the Willmore flow (the L?-gradient flow of the Willmore functional). An 
extensive overview and further literature on these flows can be found in [7]. 


In Section 4.4, we propose another way to define (pseudo)gradients for certain 
classes of Banach manifolds. In contrast to the first of the two approaches mentioned 
above, the pseudogradient grad(F) depends linearly on dF. In constrast to the second 
approach, these pseudogradients form indeed a vector field grad(F) and this vector 
field has the same regularity as dF. If F is of class Ce: then the pseudogradient flow 
defined by (4.1.3) exists locally and for small times, both in forward and in backward 
direction (see 4.4.2). Of course, one cannot expect to obtain such a strong result for 
arbitrary Banach manifolds and arbitrary functions. It turns out that the suitable cat- 
egory of manifolds to consider is the category of Riesz manifolds which we introduce 
in Section 4.3. We will also see there that the concept of Riesz manifolds is a bit richer 
than the concept of Riemannian manifolds. 

We aim at treating pseudogradient flows for quite general classes of infinite- 
dimensional Banach manifolds—and not only for open sets in Banach spaces. Such 
situations arise, e.g., when treating optimization problems with nonlinear constraints. 
The class of examples we have primarily in mind is the minimization of knot energies 
among knots in parameterization by arclength. The treatment of infinite-dimensional 
Banach manifolds will inevitably involve certain infinite-dimensional vector bundles 
so that we require the notion of a Banach bundle. This is why we give a very brief 
introduction to fiber bundles in general and to Banach bundles in particular (see Sec- 
tion 4.2). 

Section 4.5 is devoted to applications of the developed theory. We consider vol- 
ume functionals (Section 4.5.1) and certain functionals depending on extrinsic cur- 
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vature (Section 4.5.2) and show that these functionals together with suitably chosen 
domains belong to the category of Riesz manifolds (4.5.1 and 4.5.5). As a side effect, 
we verify that the category of Riesz manifolds (i) is strictly larger than the category of 
Riemannian manifolds and (ii) contains many examples of practical relevance. More- 
over, we give several numerical examples of discretized pseudogradient descent for 
the minimization of these energies subject to various constraints. In particular, these 
examples will demonstrate the efficiency and robustness of pseudogradient methods 
when applied to numerical minimization of geometric energies subject to nonlinear 
equality constraints. 


4.2 Banach Bundles 


Fiber bundles of various types and in particular vector bundles are typical working 
horses in algebraic topology, differential geometry, and global analysis. Since the 
reader might not be familiar with the notion of a fiber bundle, we give a brief intro- 
duction, being well aware that a thorough treatment of this matter may easily fill a 
full-semester master course. A beautiful and short introduction to Banach manifolds 
and vector bundles can be found in [13]. Moreover, we refer to [12], maybe the classi- 
cal text on fiber bundles, and to [14], a rather modern exposition which also covers 
great parts of the theory of partial differential equations for fields with values in fiber 
bundles. 


4.2.1 General Fiber Bundles 


The central idea of a fiber bundle is that it looks locally like a product of a base space 
and a fiber. The following definitions make this more precise. 


Definition 4.2.1. Let M, Eand X be topological spaces, let 7: E — Mbeacontinuous, 
surjective map, and let U c M be an open set. We write E|U := 1 1(U) C E. A local 
trivialization of m: E —> M on U with typical fiber X is ahomeomorphism @: E|U > 
U x X such that the following diagram commutes 


E|U —__* __, Ux xX 


ee 


Here, pr: Ux X > U, (a, x) + ais the projection onto the first factor. 


Definition 4.2.2. Let M, E and X be topological spaces and let 7: E — M be acon- 
tinuous, surjective map. A fiber bundle atlas on 1: E — M with typical fiber X isa 
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covering family of local trivializations (g;: E|U; + Uj; x X)jc; in the sense that (U,);<; 
is an open covering of M. 

Two fiber bundle atlantes with typical fiber X are called compatible if their union 
is a fiber bundle atlas. A fiber bundle atlas is called maximal if it contains each com- 
patible fiber bundle atlas. 

A fiber bundle with typical fiber X is a continuous and surjective mapping 7: E > 
M together with a maximal fiber bundle atlas with typical fiber X. With a slight abuse of 
notation, one usually omits mentioning the precise atlas and simply refers to 7: E > 
M or even to E as the fiber bundle. One refers to E as the total space, to M as the base 
space, and to 7 as the bundle projection of the fiber bundle 7: E — M. Fora € M, one 
calls Eq := 1 *({ a }) the fiber over a. Note that Eq is homeomorphic to X. 

For each pair i, j ¢ I with Uj := U; 9 U; 4 0, the following commutative diagram 
defines a homeomorphism 9%, : Uj; x X —> Uj x X, the so-called transition map: 


PilE\uy PilE|U,, 
Uy xX < E|Uj; > Uy x X 
| (4.2.1) 


Pr; 


Pr; 


Note that one has 8; = (@;;)"* and Gj 0 Dix = Dix. 

Let Re {ce |k E NU {0, cc}, a € [0, 1] withk+a21}.If Mand X are Banach 
manifolds of class R and if each transition @" is of class R, we say that 7: E— Misa 
fiber bundle of class R. Note that in this case, the total space E is a manifold of class R 


and 7 is asubmersion of class R. 


Example 4.2.3. Let M and X be topological spaces. Then E := M x X together with 
the projection 7: E > M, (a, f)+> aisa fiber bundle with typical fiber X. Such fiber 
bundles are called trivial, since there exists a global trivialization. By definition, every 
fiber bundle restricted to a sufficiently small set is a trivial fiber bundle. 


Example 4.2.4. Let E c R? be the embedded Mobius strip depicted in Figure 4.1. Let 
M c Ebe the centerline of the Mobius strip and 7: E + M be the shortest distance 
projection with respect to the intrinsic distance of E. Then 2: E > M is a nontrivial 
smooth fiber bundle with a closed interval as typical fiber. 


Example 4.2.5. Let 7,: E; ~ Mand 7: E, — M be fiber bundles with typical fibers 
X, and X> respectively. Then the product bundle m: E; xj E2 — M defined by 


Ey xy E2 = { (1, €2) € Ey x Ez | m1 (e1) = m2(e2)} and 


mt: E, xy E2 > M, m(e1, €2) = m1(e1) = m2(e2). 
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Fig. 4.1: An embedded Mobius strip as nontrivial fiber bundle over its centerline (white). Several 
fibers are depicted in orange. 


is a fiber bundle with typical fiber X; x Xp. 


Definition 4.2.6. Let 7: E — M bea fiber bundle with typical fiber X. A mapping 
o: M > Eiscalled a section of the bundle E if it satisfies 110 0 = idy, i.e., o maps each 
point a € M into its fiber Eg. If both 7: E — Mando: M — E are of class R, we say 
that o is a section of class R. 


Definition 4.2.7. Let 7,: E, — M, and 7,: Ex — My) be fiber bundles and let 
F: E, — E2 and f: M, — Mp be continuous maps. We say that F is a fiber bundle 
morphism from E, to E> over f if the following diagram is commutative: 


| eee oF 


m| [x 


M, ae ee 


Thus one has well-defined and continuous fiber mappings Fa := F\(g,),: (Eva > 
(E2)(q) for each a € Mj. We say that F: E; — E2 is an isomorphism of fiber bundles 
over f, if both f and F are homeomorhisms. In that case, the inverse F “1. E, > E;is 
a fiber bundle morphism over f~'. 
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4.2.2 Banach Bundles and Hilbert Bundles 


Note that the transition maps $j; = Ujj x X > Uj x X ofa fiber bundle 7: E > M are 
of the special form 


Pi(a, x) = (a, Yi(@M) 


with some transition function j;: Uj; — Homeo(X) into the group of homeomor- 
phisms of X. Moreover, one has (p;(a))"* = pji(a) and 9jj(a)pjx(a) = Pjx(Q). 

One may define various special types of fiber bundles in the following way: Let 
X be a space with an additional structure that is compatible with the topology of X. 
For example, X can be a topological vector space, a topological group, or a metric 
space. Denote by G c Homeo(X) the group of continuous isomorphisms of X, i.e., 
those homeomorphisms of X that also respect the additional structure. By adding the 
further requirement to a bundle atlas that its transition functions take only values in 
this group G, one may define fiber bundles whose fibers exhibit the same structure as 
its typical fiber X. In the following, we outline this construction for the special case 
that the typical fiber X is a Banach space (or Hilbert space), leading to the notion ofa 
Banach bundle (or Hilbert bundle, respectively). 


Definition 4.2.8. Let X be a Banach space and let 7: E > M bea fiber bundle with 
typical fiber X. A Banach bundle atlas on m: E + M with typical fiber X is a covering 
family of local trivializations (g;: E|U; - U;xX),-; such that each transition function 
is a continuous map 9: Uj; + L(X; X), where L(X; X) denotes the Banach algebra of 
continuous linear operators from X to X. 

A fiber bundle 7: E — M with typical fiber X is called a Banach bundle with typical 
fiber X if its maximal bundle atlas of 7: E — M contains a (maximal) Banach bundle 
atlas. 

Analogously, one defines the notion of a Hilbert bundle. The definitions are such 
that each Hilbert bundle is also a Banach bundle and that each Banach bundle is also 
a fiber bundle. 


Proposition 4.2.9. Let 1: E — M be a Banach bundle. Then each Eq, a € M carries 
the structure of a topological vector space in the following way: Choose a local trivial- 
ization p: E|U > U x X froma Banach bundle atlas of E such that a € U. Then define 
forv,v1,V2 € EaandA ec R: 


Vi (+a V2 = @ (a, pry(P(V1)) + pry(p(v2))) and A(-)av:= yp '(a,A- pry(p(v))). 
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By the special structure of Banach bundle atlantes (see 4.2.8), this is independent of the 
choice of ~ and the induced mappings" 


+:ExyE-E, Vi +V2 = V1(+)aiy,,v,)¥2 and 
-:RxE-E, Avve=AQ)ayv 


are continuous. 


Example 4.2.10. Let M bea Banach manifold of class C*, k+a > 2. Then the tangent 


loc? 
bundle 1;y: TM — MisaBanach bundle of class C1'*. Sections of TM are precisely 
the vector fields on M. 


loc 


Example 4.2.11. Let 7x: X — Mand my: Y > M be Banach bundles over M. Then 
the space L(X;Y) := Laem {a} * L(Xa; Ya) together with Tx.) ! L(X;Y) — M, 
(a, A) + ais a Banach bundle in a canonical way; a Banach bundle atlas for L(X; Y) 
can be constructed from bundle atlantes for X and Y. 

In particular, the dual bundle X "+= L(X; MR) is also a Banach bundle with fibers 
(X)a & (Xa) foralla c M. 


Definition 4.2.12. Let 7,: E, — M, and 7,: Ez — M> be Banach bundles, and let 
F: E, — E> bea fiber bundle morphism over f : M; — Mp (see 4.2.7). We say that F is 
a Banach bundle morphism from E, to E2 over f if for each a € M;, the fiber mapping 
Fa = F\(g,),? (E1)a — (E2) pq) induced by F is linear. We say that F: E, — E is aniso- 
morphism of Banach bundles over f, if it is also a fiber bundle isomorphism. Note that 
in this case, the inverse F-!: E, — E, is automatically a Banach bundle morphism 
over ft. 


4.3 Riesz Structures 


First, we introduce Riesz structures on Banach spaces (Section 4.3.1). To some extent, 
Banach spaces with Riesz structures look and feel quite much like Hilbert spaces. For 
example, 4.3.5 introduces the concept of a pseudoadjoint A” € L(X;X,) of a Riesz 
morphism A € L(X,;X2) (a concept which we utilize to introduce pseudogradients 
in Section 4.4). The generalization of Riesz structures to Banach bundles is straight- 
forward. Hence, a great deal of Section 4.3.2 will occupy us with producing a prac- 
tically relevant example. Finally, we introduce the category of Riesz manifolds (Sec- 
tion 4.3.3), a category that carries many desirable properties of the category of Rie- 
mannian manifolds. 


4.1 see 4.2.5 for the definition of the product bundle 7: E xy E — M 
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4.3.1 Riesz Structures 


Definition 4.3.1. Let X be a Banach space. A Riesz structure on X consists of 
1. acontinuous, linear and dense injection i: X <> H into a Hilbert space H; 
2. acontinuous, linear and dense injection j: H — Y into a Banach space Y; 
3. a fixed Hilbert norm ||-||,; or equivalently, its Riesz isomorphism I: H > H’; 
4, and of an isomorphism of Banach spaces J: X — Y’ 

such that the following diagram commutes: 


 —— te 


(4.3.1) 


H —————__ > H. 


We may call J a pseudo-Riesz isomorphism of X. Moreover, every Banach space X with 
Riesz structure (i, j, I, J) exhibits a pre-Hilbert metric, which can be expressed by the 
Riesz structure in the following ways 


D(x, X2) = (1x1, 1X2) y = (ix, 1x2) = x, jix2), forall x1, x2 EX. 


Remark 1. At first glance, the notion of a Riesz structure seems to boil down to the 
notion of a Gelfand triple, i.e., a topological vector space X, a Hilbert space H together 
with linear, dense embeddings X <> H <= X’. However, this is not true, since Riesz 
structures involve a third Banach space Y which need not coincide with X’, in partic- 
ular if X is nonreflexive. 


The prototypical Riesz structure is given by the following example. 


Example 4.3.2. Let (2, A, 4) be a finite measure space, let p € [2,0], and let gq € 
[1, 2] be the Hélder conjugate of p. Put X := L?(Q, y), H := L?(Q, pw), Y = L4(2; py), and 
denote by i: L?(Q, np) 3 L?(Q, pw), j: L?(Q, w) + L4(Q, w) the canonical embeddings. 
The Riesz isomorphism I: L*(Q, uw) — (L?(Q, w)) is given by (Iv1, v2) = [vivo dy 
for v1, V2 € H. Analogously, one may consider the operator J: L?(Q, yw) > (L9(Q, y))’ 
defined by (Ju, w) = {,,uwdy for u € X and w ¢ Y. Observe that Io i =j oJ. Bythe 
Radon-Nikodym theorem, J is an isomorphism of Banach spaces, hence (i, j, I,J) isa 
Riesz structure on X = L?(Q, py). 


It is essential for our applications that Riesz structures are able to encode elliptic op- 
erators. 


Example 4.3.3. Let (X’, g) be a compact Riemannian manifold all whose connected 
components have nontrivial boundary, let p € [2,o0[, and let gq € ]1, 2] be the 
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Hélder conjugate of p. Let X = W7?(2;R™) 9 Wo?(2;R™), H s= Wir(5;R™), 
Y := L9(;R™) and let i: X — H andj: H © Y be the canonical embeddings. Let 
0 <c<C <cobe constants and let b € W!"(5; Sym?(TS)) with r > dim() be a ten- 
sor field of symmetric, bilinear forms with c g|a < bla < Cg\a for all a € »’. Thus, the 
Laplace-Beltrami operator —A,: W7?(";R™) + L?(2;R™”) with respect to the met- 
ric b is well-defined and continuous (see 4.5.3). This allows us to define the operator 
Jy: XY by 


Jp u, w) = J so-avu, w)vol,h, ucX,we Y, 
zy 


where vol, denotes the Riemannian volume density induced by b. Elliptic regular- 
ity for the Dirichlet problem (see Theorem 9.15 in [10]) implies that J, is an iso- 
morphism provided that r = p. Integration by parts leads to the weak formulation 
Ip: Wo'?(23R™) > (W’?(2;1R™)) of the Laplace-Beltrami operator: 


(Ip V1, V2) 2= [ian dv2), vol, forvy,v2 € Wo (2; R™). 
By) 


The condition c g < b < Cg guarantees that I, is both continuous and continuously 
invertible. The fact that I, and J; are connected via integration by parts is reflected in 
the equality I, oi = j’ oJ, showing that (i, j, I,, J,) constitutes a Riesz structure on X. 


Definition 4.3.4. Let (X1, i1, 71,4, J1) and (X2, in, jz, In, Jz) be Banach spaces with 
Riesz structure. We call a triple (A, B, C) with A € L(X,;X2), B € L(H;;H>), and C € 
L(Y1; Y2) a morphism of Riesz structures or shorter, a Riesz morphism, if the following 
diagram commutes: 


Proposition 4.3.5. Let (X1, i1, j1, 1, J1) and (Xo, iz, j2, In, J2) be Banach spaces with 
Riesz structure and let (A, B, C) be a Riesz morphism. We define the pseudoadjoint A* 
of A by A* := J; C Jz, where C € L(Y}; Y,) denotes the dual map given by C & := € oC. 
The pseudoadjoint has the following properties: 
1. A” € L(Xz;X}). 
2. A* isan adjoint with respect to the induced pre-Hilbert metrics b, and bp, i.e., one 
has 


b7(A x1, X2) = bi(x1, A’ x2) for allx, € X; and x2 € X2. 
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3. A*A is symmetric and positive semi-definite with respect to b,. 
4, AA* is symmetric and positive semi-definite with respect to bp. 


Proof, Claim 1. Since C’, J i and J> are continuous, A” is clearly continuous. 
Claim 2. Observe 


i A™ = (iT) C Jn = Ti" i CVn = Ty" B (22) = Ty" B Inn = Bi. 
With x; € X; and x2 € X2, the second statement follows from 
bi(x1, A'X2) = (i X1, BY in X2) 4, = (Bir X1, in X2)H, = (ig AX1, in X2) Hy = B2(A M1, X2). 
Claim 3. This follows from 
b,(u, A’ Av) = bo(Au, Av) 
and 
b,(u, A’ Au) = b2(Au, Au) = |liz Aull#, = 0 


for allu, v € Xj. 
Claim 4. One has b>(AA“u, v) = bi (A*u, A*v) and 


b,(AAu, u) = by(A“u, A“u) = (i, A’, i, A°U)y, = ||B’ in ullz, = 0 


for allu,v € X>. 


4.3.2 Riesz Bundle Structures 


Analogously to 4.3.1 and 4.3.4, we define Riesz structures on Banach bundles and Riesz 
bundle morphisms as follows. 


Definition 4.3.6. Let 7: X — M bea Banach bundle. A Riesz structure on X consists 
of 
1. acontinuous, linear, and dense bundle injection i: X — H over idy into a Hilbert 
bundle zy: H > M; 
2. acontinuous, linear, and dense bundle injection j: H — Y over id, into a further 
Banach bundle zy: Y > M; 
3. a fixed Hilbert bundle norm ||-||7 or equivalently, its Riesz isomorphism I: H > 
He 
4, and an isomorphism J: X — Y’ of Banach bundles over idjy 
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such that the following diagram commutes. 


[Ne 
ee 

Let R € fCRe |k € NU{0, co} 3, a € [0,1] with k + @ = 1 and suppose that 
am: X — Misa Banach bundle of class R. We say that a Riesz structure (i, j, I, J) is of 
class R, if 


1. Hand Y are Banach bundles of class R and 
2. i, j, I, and J are Banach bundle morphisms of class R. 


W2 


S. A 


(4.3.2) 


rt <_< 


Example 4.3.7. Let ©’ be a connected, compact, n-dimensional manifold with non- 
trivial boundary. Denote by go the Euclidean metric on R™. For f ¢ WP?(;R™), 
denote by f*go := go(df:,df-) the pullback of the bilinear form go and define the 
space of Lipschitz immersions as 


Imm?*(53;R™) = {f © Wh?(5;R™) | 5C>0: Clg < fig < Cg}, (4.3.3) 


where g may be any smooth Riemannian metric on ¥. Note that Imm»*(";R™) c 
w(x"; R”) is an open set. For p > n, the Sobolev embedding theorem shows that 
W?-P(y;IR™) embedds continuously into W!°(; R™) which enables us to define 


Imm”?(5; R™) = Imm?*(L;R") n W??(5;R™). 


The trace theorem for Sobolev spaces states that the so-called trace operator 
res: C°(0;R™) > C™°(0Y;R™), res(f) = flax can be continously extended to 
res: W2?(5;R™) > WwW?» (95; 1R™). We fix an immersion y €¢ Imm? P (9d; R™) 
and consider the configuration space of immersed surfaces 


= Imm??(2;R™) := {f € Imm??(5;R™) | res(f) = 7}. 
Analogously to 4.3.3, we define the (trivial) Banach bundles 
X = Cx (W??(y; R™)n Wo?(53 R™)), H i= Cx Wo??(2; R™), Y:=CxL9(5;R™) 


along with canonical injections i: X ~ H andj: H <— Y. These mappings are fiber- 
wise continuous and dense injections. We summarize the setting in the commutative 
diagram 


Cty aetoy 

bee 

Cc — > € —>C¢. 
ide ide 
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Each immersion f € @ induces a Riemannian metric b := f* go of Sobolev class W!”?.42 
Thus, the Laplace-Betrami operator A, := Ap: W2?(5;R™) > L?(2;R"”) is well- 
defined. With the definitions and results of 4.3.3, we oberve that Jy := Jp: X- — (Y if) 
and Ip = Ip: Hp (Hr) are isomorphisms satifying If 0 ir = if o Jr. Hence the data 
(i,j, I,J) constitutes a Riesz structure on the Banach bundle X. Without going into 
detail, we mention that (i, j, I, J) is even a Riesz structure of class C™. 


Definition 4.3.8. Let (X1,i1,/1,11,Ji,) and (Xo, in, jo, In, Jz, ) be Banach bundles 
with Riesz bundle structures. We call a triple (A, B, C) of Banach bundle morphisms 
A: X; — X>, B: H; — H>, and C: Y; > Y> a Riesz bundle morphism, if (Aa, Ba, Ca) 
is a Riesz morphism for each a € M, i.e., if the following diagram commutes: 


4.3.3 Riesz Manifolds 


Definition 4.3.9. Let k ¢ NU {co} anda é€ [0, 1] with k + @ = 2. A Riesz manifold of 
class C He is a Banach manifold M of class C ins together with a Riesz structure (i, j, I, J) 
of class C. on TM. 


Definition 4.3.10. Let (My, i1, 71,11, J1) and (Mg, ia, jz, In, Jz) be Riesz manifolds of 
class ce and let F: M; — M2 bea mapping of class ce. We say that F is a Riesz 
morphism of class C He if the tangent map TF: TM, — TM) induces a morphism of 
Riesz bundles of class eo In particular, for a Riesz morphism F: M — R into the 


real numbers, we also say that F is a Riesz function. 


Example 4.3.11. Let (M, g) be a Riemannian manifold, i.e., a (smooth) Banach man- 
ifold together with a (smooth) section g of Sym?(TM ) such that for each a € M, the 
symmetric bilinear form ga: TaM x TaM — R is positive-definite and generates the 
topology on TaM. In particular, (TaM, ga) is a Hilbert space and Ig: TaM > T,M, 
(Iqu, Vv) := ga(u, v) is a Riesz isomorphism. With TM = H = Y, and J = I, i =j = idry, 
we observe that (M, i, j, I, J) is a Riesz manifold in a natural way. Note that a smooth 
map between Riemannian manifolds induces also a (unique) Riesz morphism between 
the induced Riesz manifolds. 


4.2 Since f*go depends quadratically on df, the condition p > nin necessary here. 
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Example 4.3.12. Let (M, i, j, I, J) be a smooth Riesz manifold. Then one may define a 
section g of Sym?(TM) by 


alu, v) = Yau, jiv) =(Uaiu,iv) foru,v € TaM. 


Since Iq is a Riesz isomorphism, gq is a symmetric and positive-definite bilinear form. 
Thus, (TaM, ga) isa pre-Hilbert space with completion (Ha, (Ia-, -)). If TaM is not iso- 
morphic to a Hilbert space, (TM, g) is not a Riemannian manifold. In general, there is 
no chance to complete (M, g) to a Hilbert manifold (M, &). Moreover, in the rare cases 
where this is possible, many continuous (or even differentiable) functions on M can- 
not be extended continuously (differentiably) to M. 


These examples show that Riemannian manifolds form a (proper) full subcategory 
of the category of Riesz manifolds in the sense that the functor described in 4.3.11 
is injective (but not surjective) on objects and fully faithful. Still, the latter category 
shares many useful properties with the former (see Section 4.4 below for a few exam- 
ples). Forgetting the underlying Riesz structure induces a forgetful functor from the 
category of Riesz manifolds to the category of Banach manifolds. In this sense, Riesz 
manifolds may be considered as lying somewhere between Riemannian manifolds and 
arbitrary Banach manifolds. More precisely, we have the following commutative dia- 
gram of functors 


manifolds manifolds 


(M. ee eas DOM 


Banach 
manifolds [° 


Riemannian (M,g)>(M, id, id,I,D Riesz 


4.4 Pseudogradient Flow 


Now we have all necessary ingredients for defining the pseudogradient of a Riesz func- 
tion. 

Definition 4.4.1. Let (M, i, j, I, J) be a Riesz manifold of class C Nee andletF: M+R 
bea Riesz mapping of class Ce. We define the pseudogradient field grad(F): M — TM 
by setting grad(F) := (TF)’. 


Compared to the elaborate theory that is needed for the treatment of nonlinear flows of 
parabolic type, the existence theory of pseudogradient flows relies on quite elemen- 
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tary principles. We gather the central properties of pseudogradient fields and their 
flows in the following theorem. 


Theorem 4.4.2. Letk ¢ NU{co} anda «€ [0,1] withk + a2 2. 
Let M be a Riesz manifold of class ce and let F: M > R be a Riesz function of class 
C ae Then one has 

1. Foralla € M: (dF\a, grad(F)|a) = 0. 

2. Let a € M. Then grad(F)|a = 0 if and only if dF\a = 0. 

3. The pseudogradient field grad(F) is a vector field of class C. 

4, For each interior point ag € M \ OM there is a neighborhood U of ag and ane > 0 


such that there is a unique flow ®: U x |-€, €| > M with 
f(a, t)=-grad(F)\giq,y and P(a,0)=a forall(a,t) ¢ Ux]-e,¢[. 


5. For each interior point a € M \ OM, the function t > F o &(a, t) is monotonically 
decreasing. 


Proof. Denote the Riesz structure on M with 
i: TMGH, j:HOY, I:H+H and J: TM3Y 


and let B: H + TRandC: Y — TR be the Banach bundle morphisms over F of class 
CK} with ig TF = Biand jg ig TF = Cji. 
Claim 1. Denote by bp the Euclidean inner product on TR. By 4.3.5, we have for each 


a € Mthat 
(dFla, grad(F)|a) = be(1, TaF grad(F)|a- 1) = ||Ba- 177, > 0. 


Claim 2. Note that dF|q = 0 is equivalent to TF = 0. On the one hand, the definition 

of grad(F) implies immediately that grad(F)|q vanishes if TagF vanishes (see 4.4.1). On 

the other hand, the inequality above shows that grad(F)|a = Oimplies B; = 0 and thus 
a = O. We have (ip)aTaF = Baia = 0 and since (ig)q is an isomorphism, we obtain 

TaF = Oand hence dF\q = 0. 

Claim 3. With J also J~! is of class eos Moreover, continuous bilinear combina- 

tions of Banach-space valued mappings of class C K-14 ore again of class C 1.4 Hence 


loc loc 
grad(F) = J-! C isa section of class Co in the Banach bundle TM, hence a vector 
field of class C1", 


loc 
Claim 4. Note that a +> -grad(F)|q is a vector field at least of class C i Hence the 
statement follows from the Picard-Lindel6ff theorem. 
Claim 5. This follows from Claim 2 and from the chain rule: 


d 
qt oe t)) = —(dFloca,t)» grad F|a,q,1)) <0. 


If the initial condition is not a critical point, the function t + F(&(a, f)) is even stricly 
monotonically decreasing, since stationary points cannot be reached within finite 
time. 
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4.5 Applications 


Geometric functionals for curves and surfaces, such as those we discuss in this sec- 
tion, play a major role in the field of geometry processing and are applied to a variety 
of tasks. Since reducing computational time is a specific objective in this field, it is no 
wonder that the advantages of H- and H?-gradient flows have already been observed 
and utilized within this community (see [9]). Our aim here is to give some more theo- 
retical justification on these flows, even for the pre-discretized, infinite-dimensional 
setting, which is often not covered appropriately. 


4.5.1 Minimal Surfaces 


Let us return to the setting of 4.3.7. Observe that X = TC is precisely the tangent bun- 
dle of € = Imm?” (2;R™), the configuration space of immersions subject to Dirichlet 
boundary conditions. As we have seen in 4.3.7, the data (i, j, I, J) represents a smooth 
Riesz structure on the Banach bundle TC, hence (€, i, j, I, J) is a smooth Riesz mani- 
fold. Now consider the volume functional 


F:C oR, F(f/x [voles ‘ 
Zp 


Theorem 4.5.1. The volume functional F is a smooth Riesz function on the smooth Riesz 
manifold Imm7? (1; R™). 


Proof. One has 
Df +> volyeg,) u = (df, du) reg, Volpe e, (4.5.1) 


for each Lipschitz immersion f €¢ Imm’°(2;R™) and each u € T,Imm(2;R™) = 
W)(y;R™). Hence, the volume functional is differentiable and its differential 
dF: C > T Cis given by 
(dF |p, u) = [iar du) rg, VOlpeg, forue TC = W??(5;R™) 9 Wo? (2; R™). 
xy 


Extending dF|, continuously to +, leads to the smooth section B: C > +H given by 
(Br, V) = fw. dv) peg, VOlprg, forv € Hp = Wo'7(5;R™). 
By: 


Via integration by parts, we obtain 


(Ajo) = f g0l-A/F,w) volpg, 
ay 
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Since A,f € L?(;R™), this leads to the smooth section C: C > Yy’ given by 
(Cp, W) 2= J soar. W) Volrg, for w € Uy = L4(5;R™). 
xy 


Moreover, the family (C;)¢<e@ induces asmooth Banach bundle morphism C: Y > @xR 
and one has jp ip TF = Cji, hence F is a Riesz function on C. 


Combining 4.4.2 and 4.5.1 allows us to discuss the pseudogradient flow of the volume 
function F. For an immersion f € €, the defining equation for the pseudogradient 
u = grad S|, can be written as 


[(anavi re, Vol rig, = [iar AV) peg, VOlpeg,, forallv c Hy = Wo? (23 R™). 
7; x 
(4.5.2) 


Note that the weak formulation of the Laplace-Beltrami operator occurs on both sides 
of the equation. For numerical optimization, we have to discretize the configuration 
space @ and the objective function. To this end, we may fix a triangulation J of 1 
and define the discrete configuration space Cz as the set of all immersions that are 
piecewise-linear with respect to J and that restrict to y on all boundary vertices of J. 
This way, the image f() of an element f ¢ Cz is an immersed simplicial mesh whose 
boundary complex is inscribed into (0). A discrete volume function ¥5 can be de- 
fined straight-forwardly as the restriction of the volume function to Cy. Since Cz is fi- 
nite dimensional, it is a Riemannian manifold when equipped with the discretization 
of the weak formulation of the Laplace-Beltrami operator and the discretized pseudo- 
gradient u = grad(F-+s)|-¢ can be defined by 


[au Av) peg, VOlpeg, = (AF s/f, v) = fw. Av) tg, VOlpeg,, forall v € TCz. 
x xy 
(4.5.3) 


The pseudogradient descent algorithm consists now in choosing an initial guess fo € 
Cz and by computing recursively 


fri =fn-Tn grad(Fz)|p,, forn e Nu {0}, (4.5.4) 


where grad F5|, has to be computed from (4.5.3) and Tn > 0 is a step size parameter 
that has to be chosen appropriately. Again, the (discretized) Laplace-Beltrami operator 
occurs on both sides of the equation. This comes in handy, as it has to be reassembled 
in each iteration and this way, it can be used at least twice: once for computing dF5|, 
and once for solving for the pseudogradient. Since assembling the discrete Laplace- 
Beltrami operator is a standard task, the implementation is pleasantly easy. We point 
out that this algorithm has already been introduced in [15], however, from a different 
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Od 


Fig. 4.2: From left to right: initial surface (512 faces); surface after a single explicit Euler step in 
negative discrete mean curvature direction; surface after a single explicit Euler step in negative 
discrete pseudogradient direction. 


00 w 


Fig. 4.3: The same as in Figure 4.2 with the same step sizes, but with refined mesh (32 768 faces). 


perspective. The mentioned article is also a good reference for the assembly details of 
discrete Laplace-Beltrami operators on discrete surfaces. 

When Tp is chosen to be independent of n, (4.5.4) boils down to an explicit Euler 
scheme for the (discrete) pseudogradient flow. In principle, one may apply also more 
sophisticated step size rules for line search, such as the Armijo algorithm or the Wolfe- 
Powell algorithm. 

In the case of the area functional, it turns out that one can go quite well with con- 
stant step size of magnitude tT = 1 and this is essentially independent of the mesh 
resolution. In our experiments, we rarely needed more than two dozen pseudogradi- 
ent steps in order to reach a configuration from which Newton’s algorithm converged 
within several steps to a critical point—no matter if the discrete surface consisted of a 
few thousand ora million triangles. As Figure 4.4 suggest, the discrete pseudogradient 
flow performs well and is very robust even in cases where minimizers do not exist. 

These findings are in strong contrast to the step size rules for discrete mean cur- 
vature flow, the L?-gradient flow of the volume functional: As discrete mean curva- 
ture flow is a parabolic partial differential equation (in time and space), the step size 
has to be decreasing along with the mesh size, even if more sophisticated integration 
schemes such as (semi-)implicit methods are employed. As a comparison, we include 
Figure 4.2 and Figure 4.3, each depicting a step of discrete L?-gradient descent and 
discrete pseudogradient descent at different mesh resolutions. We can also see there 
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Fig. 4.4: Diverging pseudogradient flow of area functional in the absence of minimizers. It seems to 
converge to a minimizer in another topological class. 


that the L*-gradient is increasingly localized with raising L*-norm as the mesh size is 
decreased. Of course, the reason is that the initial surface is not twice differentiable so 
that its mean curvature is a vector-valued distribution with support concentrated on 
the creases. Hence the discrete mean curvature, trying to approximate this distribu- 
tion, blows up under mesh refinement. In order to obtain a stable algorithm, the step 
size has to be decreased accordingly. 

On the other hand, the information contained in the mean curvature distribution 
is spread out over the whole surface when solving for the pseudogradient.*? In par- 
ticular, points in medium distance to an “incident” get informed immediately during 
the next pseudogradient step so that they have the opportunity to “react” just-in-time. 
See also Figure 4.5 for a typical example of an “incident” that occurs quite frequently 
with discrete mean curvature flows but considerably less often with discrete pseu- 
dogradient flows. We point out that this behavior is primarily an artifact of the time 
discretization: Mean curvature flows with infinitesimal step size have infinite prop- 
agation speed and this infinite speed is hard to capture by a discrete time stepping 
algorithm. 


4.3 Note that the pseudo-Riesz isomorphism is still an elliptic operator such its inverse is smoothing. 
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Fig. 4.5: L?-gradient descent (first column) and pseudogradient descent (second column), both for 
the area functional, with explicit Euler scheme and constant step sizes. 
First row: initial surfaces; second row: after first step; third row: after nine steps. 
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4.5.2 Elasticae 


Curvature dependent energies such as the elastica functional can also be well- 
formulated on the space Imm?'?();R™), when p is suitably chosen. While the dis- 
cussion of the volume functional was dominated by the Laplacian operator, it turns 
out that a suitable operator for defining a Riesz structure for curvature energies is the 
bi-Laplacian. If p > dim(), the discussion is even simplified as we do not need to 
resort to the space Imm**?(; R™), where the strong formulation of the bi-Laplacian 
resides. 

Let X’ be an n-dimensional, compact, and connected smooth manifold. In order 
to keep the exposition as brief as possible, we focus our attention to manifolds with 
nontrivial smooth boundary and to the space 


Cs Imm??(; R™), pé€[2,e9[N]n, cof 
for a given y € Imm2->P (2; R™).44 From now on we use the Banach bundles 
TCS es (WSR in We (eeR™); 
H = Cx (W?7(2;R")n Wo(23R™)), and 
Y = Cx (W24(E;R™) 1 Wo'4(25R")), 


where q is the Hélder conjugate of p.*° Note that p is chosen such that the canonical 
injections i: X — H andj: H — Y are dense in each fiber. From 4.3.3, we recall that 
the Laplace-Beltrami operator Ay maps wW2"(5;R™) 9 Wo" (23 R™) isomorphically to 
L'(7;R™) for r € ]1, p] such that we obtain a Riesz structure (i, j, I, J) with the iso- 
morphisms 


TI: H-> KH, (pv; V2) = J sola, Ayv2) VOl rts, 5 (4.5.5) 
Dy 

IXY, (Jpu, W) 2= J soldne.ayw) VOl fig, « (4.5.6) 
Dy) 


In order to define the elastica functional, we have to introduce the second fundamen- 
tal form of an immersion. Therefore, we make a short excursion to the Hessian of a 
function on a Riemannian manifold. 


Proposition 4.5.2. Let f € @,r € [1, e], andu € W(X; R™). Define the Hessian 
Hess,(u) of u with respect to f by 


Hess,(u)(X, Y) = (d(dudf")X)- (df Y), forall X, Y € Ta. 


4.4 Thecase of manifolds without boundary can be treated, e.g., by imposing the nonlinear barycen- 
ter constraint [' f vol, = 0. 

4.5 We point out that the analogous Banach bundles for treating the case 0’ = would not be mere 
products anymore. 
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One has Hess,(u) € L™"®-(5; Sym?(T 5; R™)) and Hess, is a continuous linear oper- 
ator® 


Hess/: W7"(;R™) > L™?@-9(5; Sym2(TE; R™). 


Proof. Let u € W"(X;R™) so that du is an element of W!"(©;Hom(T;R™)). The 
Moore-Penrose pseudoinverse restricted to linear maps of fixed rank is a smooth trans- 
formation. A concise treatment can be found, e.g., in [11] which allows us to deduce 


D(f + df") u = -df" dudf* + df'(du df") (idp -df df’). (4.5.7) 


Hence one has df’ ¢ W1?(x;Hom(R™; T)) and 4.5.3 below completes the proof. 


Lemma 4.5.3. Let E,, E2, and E3 be smooth Banach bundles over the compact, smooth 
manifold 3’, let uw: Ey xy E2 — E3 bea locally Lipschitz continuous bilinear bundle map 
and let p > dim(). 

Then pt o (01, 02) € wi-min®.)(5; E;) holds for all sections 0, € W'?(5; E,) and 
02 € W""(; E>). Moreover, the induced bilinear map 


A: W'?(53E1) x W?"(53 Ey) > Wemin®-(55 By) 
is continuous. 


Proof. It suffices to perform the regularity analysis locally. Thus, we may focus our 
attention to an open set U c 3) and we may assume that FE;|U ¥ UxR™,i € {1, 2,3} 
are trivial vector bundles. Moreover, we may write 0;(x) = (x, f1(x)), 020) = (x, fod), 
and px = By for allx € U with f; « W??(5;R™), fp © W'"(Z;R™), and Be 
w)(U; Bil(R™ x R™;R™)). 

The Sobolev embedding W1!?(x;R™) — L™°(2;R™) shows that 
Bi, f2) € L(2;R™). With n := dim(), one has the Sobolev embedding 
W?"(y;R™)  L'(y;R™) where 

[1,24], r<n, 
re 4[l,cof, r=n, 


[1,co], r>n. 
For each smooth vector field X on U, we obtain 


d(B(fi, f2)) X = (dB X)(f1, fo) + Bidfi X, fol) + Bf, df2 X) 


4.6 We point out for geometers that the section Hess,(u) coincides with the Hessian vi "godu, provided 
that f and u are sufficiently smooth. Here, VF's denotes the Levi-Civita connection of the Riemannian 
metric f* go 
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and this together with the Holder inequality implies d(B(/t, f2)) € 
L™ins.(5; Hom(T’ 5;R™)), hence B(fi,fo) € Wr™™G(y;R"), where s = 
G + +) +. We analyse the following three cases: 

Case 1.: r < n. Because of p > n > r, we have 


so that min(s, r) = r = min(p, r). 
Case 2.: r = n. One may write r = n = (1+ )"'p < p withsomee > 0. Choosing 
7 = 2 < co, we obtain 


This shows min(s, r) = r = min(y, r). 
Case 3.: r > n. Then one has 7 = co ands = p, leading directly to min(s, r) = min(y, r). 
The continuity of A follows from the already mentioned Hélder and Sobolev inequal- 
ities. 


Definition 4.5.4. The second fundamental form II(f) of f can be written as II(f) := 
Hess,(f) so that one has 


I(f) € LP(5; Bil(TS xs TE;R™)) c L7(5; Bil(TE x TY; R™)). 


We define the elastica functional ¥: C + R by 


1 
sf) = 5 / IH(F)| rg, Volpe, « 
>» 


Theorem 4.5.5. For p € [2, cof N |dim(2), co [, the elastica functional ¥ and the Will- 
more energy W are smooth Riesz functions on the smooth Riesz manifold (C, i, j, I, J) 
from (4.5.5) and (4.5.6). 


Proof. Fix f < Cand let u € TC c W?-P(y; IR™). From (4.5.7), we may deduce 
DI(f) u = (idigm -df df") Hessp(u) + (du df")’ H(f). 


Equation (4.5.1) provides us with a formula for the derivative of f ++ volys,,. More- 
over, we have ISlFeg, = |S(df" -, df" -)|Z, for S € L'(2; Bil(TY xs, TY; R™)). This would 
allow us to compute a precise expression for (dF|,, u), but it already suffices for our 
considerations to observe that d¥ is of the form 


(asp, u) = 5 / ((H(), Hess/(u)) 2g, + HCI), af, df", du)) voljrg, 
Dy 
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where p(II(f), df, df", du) is a polynomial expression in II(f), df, df", du with constant 
coefficients in which II(f) occurs with order two and du occurs with order one. 

Now let w € Yy = W*-4("; R™). Note that we have II(f) € L?, df €¢ L®, anddf* € 
L®™, hence p(I(f), df, df", -) € LP/2. In the case n = 1, we have dw € W!*4 <5 L®. For 
n> 2, we have dw € W'4 <> L’ withr = pt . Because of p > n, we obtain 


p p é 
2 > = 2), 
re Pg > P= iD) 


thus dw ¢ L®/2). In any case, we obtain p(I(f), df, dft,dw) ¢ L}. This shows 
that dF|, can be continuously extended to Cr: Yr — R so that jpipTF = Cji 
holds. Hence, F is a Riesz function. The statement for W follows from the identity 
Hy = ims) yo WCF) (df*e;, df*e;) for any go-orthonormal basis e1,...,@m of R™ 
and from the above discussion. 


4.5.3 Euler-Bernoulli Energy and Euler Elastica 


The elastica functional is well-known under different names in dimensions one and 
two. For dim() = 1, the functional F is identical to the Euler-Bernoulli bending energy 


1 2 
F(f) = 5) [lie Vol fig,» 
>» 


where x; is the curvature of the regular curve f: ’ + R™. Let / be a compact interval 
equipped with the Euclidean metric g, let y: 0 > R™, andv: oY > S™1 CR™ be 
given. 

The critical points of the Euler-Bernoulli bending energy F on the set 


{f EC | volre, =Volg, flags=, n(f) =v} (4.5.8) 


are called Euler elasticae (with clamped ends) where the outward unit normals are de- 
noted with n(f): 0 > S™-1. While f|ys: = ¥ is clearly an inhomogeneous Dirichlet 
boundary condition, one may call n(f) = v a Neumann-type boundary condition. Note 
that n(f) depends nonlinearly on f. The constraint vol;s,, = volg amounts to the re- 
quirement that all curves f in the feasible set shall be in arclength parameterization 
(with respect to the given metric g). 

With Figure 4.6 and Figure 4.7, we provide two numerical examples of discretized 
pseudogradient flows for the Euler-Bernoulli bending energy subject to arclength pa- 
rameterization constraints. We discretize the space C by polygonal lines. Since polyg- 
onal lines are almost never elements of W*’*("; R™) (unless they are straight lines), 
we have to discretize the energy F as well. Our choice is 


5(f) = ; (2 secant 2)" g, 
i=1 


Lj 
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where @; denotes the turning angle of the polygonal line f at vertex i (i.e., m minus the 
angle enclosed by the neighboring edges) and where £; is the average of the lengths of 
contiguous edges. As discrete pseudo-Riesz isomorphism, we use (4.5.5) and (4.5.6), 
where A; is replaced by the discrete Laplace-Beltrami operator. Strictly speaking, this 
is not justified by the smooth theory of pseudogradient flows as the discretization is 
nonconforming. However, pseudogradient search using this discretization performs 
surprisingly well as the figures attest. For comparison, we refer the reader to [3] and 
[4], where the L?-gradient flow of the Euler-Bernoulli energy is treated. 

We point out that the presented flows preserve the parameterization by arclength. 
This was achieved by 


— projecting the pseudogradient to the tangent space of the constraint manifold; 

— performing an Armijo line search on the osculating circle of the constraint mani- 
fold in pseudogradient direction; 

— and projecting the result onto the constraint manifold via a Newton-type algo- 
rithm involving a pseudoinverse of the linearized constraint map. 


Of course, each of these steps involves operations that are usually only available for 
Riemannian manifolds. The methods go through numerically as (i) the discretized 
pseudo-Riesz isomorphism actually induces a Riemannian structure on the discrete 
configurations space Cs and (ii) the Euler-Bernoulli energy can be formutated on the 
Riemannian manifold Imm??() ; R™). However, we believe that at least some of these 
operations can also be established for general Riesz manifolds. 


4.5.4 Willmore Energy 


As mentioned before, the case dim(2) = 1 is very special in that the elastica functional 
can be well-formulated on the Riemannian manifold Imm*?(; R"™). Therefore, we 
also consider the case dim() = 2. One has the relation |II(f)|? = xt + x3, Kp = Ky K2, 
and |H,|? = 7(x, +k)? for the the principle curvatures x, and K>, the Gauss curva- 
ture Ks, and the mean curvature vector Hy of the immersed surface f. This leads to the 
identity 


F(f) = Je |H,| = Ky) VOl f# g, . 
5) 
In the case that 0X’ = 0, the Gauss-Bonnet theorem and denseness considerations 


imply that 


[* volfig, = 2 1. x(E) (4.5.9) 
z 
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Fig. 4.6: Discrete elastic figure eight knot (2000 edges): (a) initial condition; (b)-(e) the first four 
iterations of pseudogradient flow with Armijo line search; (f) ultimate minimizer (the round circle), 
obtained after 6 pseudogradient steps in total and 3 Newton iterations. 
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Fig. 4.7: Three discrete elastic threads (6000 edges in total) joint together with clamped boundary 
conditions imposed at the free ends: (a) initial condition; (b)—(e) the first four iterations of pseudo- 
gradient flow with Armijo line search; (f) ultimate minimizer, obtained after 4 pseudogradient steps 
in total and 4 Newton iterations. 
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with the Euler characteristic of ©’. Hence, F is essentially identical to the Willmore 
energy 


1 
Wi) fuse volpig, « 
a 


Different scaling conventions for Hy and equation (4.5.9) lead to a confusing variety 
of definitions for the Willmore energy in the literature, another favorite variant be- 
ing [ |Ayl? VOlfig, = f (Ki - Ky)? vol,#,,. In general, all these energies differ by a 
total Gauss curvature expressions and by boundary integrals involving Dirichlet and 
Neumann boundary data and thus lead to essentially the same optimization problems 
when subject to both Dirichlet and Neumann-type boundary conditions. 

With Figure 4.8 and Figure 4.9, we provide two numerical examples for discretized 
pseudogradient flows for the Willmore functional subject to constrained total area. 
We use triangle meshes as discrete surfaces, so that the Willmore energy has also to 
be discretized (see [8] for details on the discretized Willmore energy that we used ). 

The numerical experiments we have conducted so far indicate that pseudogra- 
dient search provides a quite efficient and robust method for minimizing curvature 
dependent energies—at least if compared to the Willmore flow, the L*-gradient flow 
of the Willmore energy. The latter is a forth-order parabolic flow and it suffers even 
more severely from the time discretization issues discussed at the end of Section 4.5.1. 
More details on the Willmore flow along with numerical examples for comparison can 
be found, e.g., in [2] and [8]. 

Concerning the minimization of the Willmore energy of surfaces, we point out that 
there is a further very efficient method which involves an L*-gradient descent in “cur- 
vature space” (see [6]). However, this method is heavily based on the very special re- 
lationship between mean curvature and conformal geometry and it is not clear if and 
how this method can be carried over to other geometric energies. 


4.6 Final Remarks 


Another interesting feature of pseudogradient flows is that all points of a flow tra- 
jectory have regularity not below the initial condition, provided that the used Riesz 
structure is defined by an elliptic operator. This does not directly imply that critical 
points are arbitrarily smooth (since they will usually be obtained only as limit of the 
trajectory for tf — oo), but, maybe, this can exploited for regularity theory in cases 
where long time existence and a priori bounds for the behavior along the flow (e.g., in 
the spirit of Grénwall’s inequalities) can be provided. 

We plan to apply the presented techniques to knot energies such as the Mobius 
energy and integral Menger curvatures. Their L?-gradient flows (if existent) would be 
parabolic of fractional order somewhere between 2 and 4. The Mobius energy is best 
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Fig. 4.8: Discrete surface (~ 93k faces) relaxing under the pseudogradient flow of the Willmore 
energy subject to an equality constraint on total area: (a) initial condition; (b)—(e) iterations 6, 12, 
18, 24, and 30 of the flow. Initial model kindly provided to the public by Keenan Crane. 


106 —— Henrik Schumacher 


Fig. 4.9: Discrete surface (~ 42k faces) relaxing under the pseudogradient flow of the Willmore 
energy subject to an equality constraint on total area: (a) initial condition; (b)—(e) iterations 6, 12, 
18, 24, and 30 of the flow. Initial model kindly provided to the public by Keenan Crane. 
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described on the constraint manifold of an arclength parametrization constraint.*’ 
Moreover, integral Menger curvatures become arbitrarily small when scaling the knot 
with large factors (the same occurs for the Euler-Bernoulli energy), and constraints 
such as total length or parameterization by arclength have to be imposed in order to 
obtain a well-posed optimization problem. Thus, both energies require, in one way 
or the other, a nonlinear constraint for using smooth optimization techniques. We ex- 
pect that their analysis might benefit significantly from pseudogradient descent with 
respect to suitable (yet to be found) Riesz structures. 
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Sebastian Scholtes 
Discrete knot energies 


Abstract: The present chapter gives an overview on results for discrete knot ener- 
gies. These discrete energies are designed to make swift numerical computations and 
thus open the field to computational methods. Additionally, they provide an indepen- 
dent, geometrically pleasing and consistent discrete model that behaves similarly to 
the original model. We will focus on Mobius energy, integral Menger curvature and 
thickness. 


Keywords: Mobius energy, thickness, ropelength, ideal knot, Menger curvature, polyg- 
onal knot, knot energy, ’'-convergence, discrete energy, knots 


MSC: 49J45, 57M25, 49Q10, 53A04 


5.1 Introduction 


In classic knot theory, mathematicians are often interested in knot classes and how to 
distinguish between them, for example via knot invariants. Contrary to this approach, 
geometric knot theory deals with the specific shape of knots and how to find or com- 
pute particularly nice representatives of a given knot class. The exact meaning of nice 
depends on the context and can vary from applied (see the section on thickness) to 
theoretical considerations (see the section on integral Menger curvature). To capture 
the “quality ” of aknot, Fukuhara introduced the concept of a knot energy (see [27]). An 
optimal representative is then said to be a minimizer of this energy among all curves of 
a given knot class. Later on, this approach was further developed by other authors (see 
[76, 17, 88]) and by now, a functional on the space of knotted curves that is bounded 
from below and gets infinite as curves approach a self intersection is called a knot en- 
ergy (see [53]). This definition already includes some helpful ingredients to show that, 
indeed, minimizers of the energy exist. The Mdbius energy 


7 1 1 
” Gre: sata) 


for arc length curves ¥, is a particular example of such an energy (see [49]). In this 
chapter, we additionally consider integral Menger curvature and thickness. Each of 
these three energies is connected to curvature. For an overview of regularizing and 
knot theoretic properties of different curvature energies see [84, 86, 87, 11, 12]. 
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One way to think about knot invariants vs. knot energies is that the first one gives 
rough information about the shape of the knot, for example, by restricting the knot to 
belong to a certain knot class, while the energy resolves the finer details. For instance, 
an energy bound can in turn give bounds on the curvature, bi-Lipschitz constant, av- 
erage crossing number or stick number. 


The explicit shape of energy minimizing knots is only known in case of the unknot. 
For most energies, the energy optimal unknot is proven or expected to be the round 
circle (see [1]). Therefore, it is important to know how to make the energies accessible 
to computers and approximate such minimizers. One approach is to investigate so- 
called discrete knot energies, that is energies which are defined on polygonal knots. 
Here, the goal is to minimize these discrete energies in the class of polygonal knots 
of a fixed knot class and fixed number of vertices and then prove that these minimiz- 
ers converge to the minimizer of a “smooth” knot energy within the same knot class. 
When trying to find such a discrete energy, the most obvious approach would be to re- 
strict the original energy to the space of polygonal knots. However, such an approach 
does not work, as polygons have infinite energy. This is related to the fact that all three 
energies are regularising, i.e. curves of finite energy are more regular than the curves 
for which the energy is well-defined. By borrowing appropriate concepts of curvature 
from discrete geometry and replacing smooth notions by discrete ones (for example 
integrals by sums), it is possible to define discrete energies in the same spirit as the 
original energies. Hence, these energies are not merely discretisations but discrete ver- 
sions of the original energy. A suitable convergence of energies, which to some extent 
already includes the convergence of minimizers, is !-convergence. 


For each of the three energies, there is a section in which we first consider the 
history of the energies and comment on recent developments. Then we introduce the 
appropriate discrete energies and explain the connections to the original energies. For 
the sake of clarity, we present most results for curves of length 1. In the appendix, we 
give a short introduction to I-convergence. 


5.1.1 Notation 


In the following sections, C is the space of closed arc length curves with length one and 
Pn the subspace of equilateral polygons with n segments. Furthermore, we abbreviate 
Cip = EN (C ky U, EN Pn), where C K is the class of k-times continuously differentiable 
functions. We write C: for the class of functions in C whose derivatives are Holder 
continuous with exponent a and L4 for the Lebesgue spaces of qg-integrable functions. 
By W*4 we denote the standard Sobolev spaces of k-times weakly differentiable closed 
curves with q-integrable weak derivative. Adding a knot class « in brackets to a set 
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of curves restricts this set to the subset of curves that belong to the knot class K. The 
circle of length L > 0 is denoted by S;. 


5.2 Mobius Energy 


The Mobius energy 


1 1 
oie Greer tp) 8 


is defined on closed rectifiable arc length curves ¥ of length L. Here, ds, is the intrinsic 
metric on S;. This energy was introduced by O’Hara (see [49]) and has the interesting 
property that it is invariant under Mobius transformations, hence its name. O’Hara 
could show that finite energy prevents the curve from having self intersections. Later 
on, the existence of energy minimizers in prime knot classes was proven by Freedman, 
He and Wang (see [26]), while there is a folklore conjecture, usually attributed to Kus- 
ner and Sullivan, questioning the existence in composite knot classes based on com- 
puter experiments (see [41]). Additionally, it was shown that the unique absolute min- 
imizer is the round circle (see [26]). Similar uniqueness results are known for broader 
classes of energies (see [1]). The regularity of minimizers and, more generally, of criti- 
cal points was investigated and smoothness could be proven (see [26, 32, 63, 64, 14]). 
Furthermore, it was shown that the Mobius energy of a curve is finite if and only if the 
curve is simple and the arc length parametrisation belongs to the fractional Sobolev 
space W3/2;2 (see [4]). The gradient flow of the Mébius energy was investigated (see 
[5, 8]) and results for the larger class of O’Hara’s knot energies (see [51, 50, 52]) are 
available (see [9, 65, 10, 7]). 


A discrete version of the Mébius energy, called minimum distance energy, was in- 
troduced by Simon (see [74]). If p is a polygon with n consecutive segments X; this 
energy is defined by 


ema,n(P) = Uma.n) _ Uma,n(En) (5.2.1) 

with 

n n 

[Xi] X)| 
U i= o> 
uO ama 
= = 
X;,X; tok adjacent 

where gy is the regular n-gon. Note, that this energy is scale invariant. Similar energy 
functionals for polygons were considered previously (see [27, 16]). There is also a vari- 
ant of the Mébius energy for graphs (see [36]) and for curves with self intersections 
(see [24]). It is know that the minimum distance energy is Lipschitz continuous on 
sublevel sets: 
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Theorem 5.2.1 (Minimum distance energy is Lipschitz continuous, [75]).. The mini- 
mum distance energy Ema,, is continuous on the space of nonsingular polygons with 
n segments. Moreover, it is Lipschitz continuous on the subspace of polygons of length 
at least £, whose energy is bounded by a constant E. 


Furthermore, the existence of minimizers for every tame knot class was established: 


Theorem 5.2.2 (Existence of discrete minimizers for Eman, [75]). For every tame knot 
class XK there is a minimizer of Ema,n in the set of all polygons with n vertices that belong 
to K. 


For this energy, approximation results for suitably inscribed polygons could be shown: 
Theorem 5.2.3 (Convergence for inscribed polygons, [61]). Let y € CM C? and let pn 


be inscribed polygons that divide y in n arcs of length i. Then 


Cy 


|EQ) 7 Ema,n(Pn)| <7: 
ni 


If nis large enough, the constant C(y) can be chosen as 
290 

C(y) = —, 
Alylé 


where A[7] is the thickness of y (see Section 5.4). Hence, 
Ema,nn) — E(y). 


Furthermore, an explicit error bound on the difference between the minimum distance 
energy of an equilateral polygonal knot and the Mobius energy of a smooth knot, ap- 
propriately inscribed in the polygonal knot, could be established in terms of thickness 
and the number of segments: 


Theorem 5.2.4 (Energy bound for inscribed knots, [62]). Let p be an equilateral poly- 
gon of length 1. Then there is an “inscribed” C? knot yp such that 


lEma.n(P) - ECop)| 2 ate a ou i 2 x a ee cst) 


and the constants C;(p) depend in an explicit way on negative powers of Ay[p]. 


However, from these results it is not possible to infer that the minimal minimum dis- 
tance energy converges to the minimal Mobius energy in a fixed knot class. For the 
overall minimizers of the minimum distance energy the following result is known: 


Theorem 5.2.5 (Minimizers of md energy, [89, 78, 79]). The minimizers of the mini- 
mum distance energy Ema,n are convex and forn € {4,5} these minimizers are the 
regular n-gon. 
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This evidence supports the conjecture that the regular n-gon minimizes the minimum 
distance energy in the class of n-gons. Numerical experiments regarding the minimum 
distance energy under the elastic flow were carried out (see [33]). 


Another, more obvious, discrete version of the Mébius energy was used for numer- 
ical experiments (see [39]). This energy, defined on the class of arc length parametri- 
sations of polygons of length L with n segments, is given by 


n 1 ; 
En(p) := ) d(a;41, a;)d(aj41, a), (5.2.2) 

(Ga rae zap) Gis1, A) ALAjy1, Gj 

ifj 


where the a; are consecutive points on S; and p(a;) the vertices of the polygon. Also 
this energy is scale invariant and it is easily seen to be continuous on the space of 
nonsingular polygons with n segments. The relationship between this discrete Mébius 
energy and the classic Mébius energy could be established in terms of a -convergence 
result: 


Theorem 5.2.6 (Mobius energy is I’-limit of discrete Mobius energies, [68]). For q € 
[1, co], ll-l] © {Il-llzac, ea)> Ill w2.a(g, spay } @nd every tame knot class K holds 


En > Eon (C1,p(H0), [lll) 


Since we already know that minimizers of € in prime knot classes exist and are 
smooth, Theorem A.1.2 implies the convergence of discrete almost minimizers: 


Corollary 5.2.7 (Convergence of discrete almost minimizers, [68]). Let K be a tame 
prime knot class, pn € Pn(K) with 


ae En - En(pn)| > O and Pn>y€ CK) inL'(Si, R%). 
Then ¥ is a minimizer of € in C(K) and Jim En(pn) = E(9). 


The result remains true for subsequences, where the number of edges is allowed 
to increase by more than one for two consecutive polygons. Since all curves are 
parametrised by arc length, it is not hard to find a subsequence of the almost mini- 
mizers that converges in C°, but generally this does not guarantee that the limit curve 
belongs to the same knot class or is parametrised by arc length. For polygons inscribed 
ina C'1 curve, there is an estimate on the order of convergence: 


Proposition 5.2.8 (Order of convergence for Mobius energy, [68]). Let + € 
c'1(S,, R®) be parametrised by arc length and c,é > 0. Then for every € € (0,1) 
there is a constant Cz > 0 such that 


[EG) - En(pn)| < 
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for every inscribed polygon pn given by a subdivision b;, k =1,...,n of Sz such that 


Cc : Cc 
5 Simin, |1Bxoa) ~ a(p)| max (yar) —r1bp)| <5 
This is in accordance with the data from computer experiments, which suggests that 
the order of convergence should be roughly 1 (see [39]). If no regularity is assumed, 
the order of convergence might not be under control, but still, the energies converge: 


Corollary 5.2.9 (Convergence of Mobius energies of inscribed polygons, [68]). Let 
y € C with E(y) < cv and pn as in Proposition 5.2.8. Then limn— oo En(pn) = E(y). 


In contrast to the situation for the minimum distance energy, the overall minimizers 
of the discrete Mébius energy are known: 


Lemma 5.2.10 (Minimizers of discrete Mobius energy, [68]). The unique minimizer of 
En in Pr is the regular n-gon. 


An immediate consequence is the convergence of overall discrete minimizers to the 
round circle: 


Corollary 5.2.11 (Convergence of minimizers to the round circle, [68]). Let pn € Pn 
bounded in L® with En(pn) = inf, En. Then there is a subsequence with pn, — yin 
w'(S,, R%), where y is a round unit circle. 


One of the main differences between the discrete Mobius energy (5.2.2) and the mini- 
mum distance energy (5.2.1) is that bounded minimum distance energy avoids double 
point singularities, while for (5.2.2) this is only true in the limit. This avoidance of sin- 
gularities permits to prove the existence of minimizers of the minimum distance en- 
ergy (5.2.1) via the direct method. This might be harder or even impossible to achieve 
for the energy (5.2.2). Nevertheless, the relation between the discrete Mobius energy 
(5.2.2) and the smooth Mobius energy is more clearly visible than for the minimum 
distance energy (5.2.1), as reflected in Theorem 5.2.6 and Corollaries 5.2.7-5.2.11. 


5.3 Integral Menger Curvature 


The integral Menger curvature was first considered by Mel’nikov (see [47]) as a concept 
of curvature of a measure that is naturally connected with the Cauchy transform of this 
measure. For closed arc length curves + of length L, the integral Menger curvature is 
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given by 


Ms(q) #= / i / x8(4(t), (uw), 1(v) dt dudy, 


Sp Sp Sr 


where s € (0, co) and x(x, y, z) is the inverse of the circumradius r(x, y, z) of the three 
points x, y and z. The integral Menger curvature for s = 2 played an important role 
in the solution of the Painlevé problem, i.e. to find geometric characterisations of re- 
movable sets for bounded analytic functions, see [54, 23, 90] for a detailed presenta- 
tion and references. There is aremarkable theorem, which states that one-dimensional 
Borel sets in R@ with finite integral Menger curvature MM) are 1-rectifiable (see [42]). 
These results for (2 were later extended to sets of fractional dimension and metric 
spaces (see [43, 31]). As a consequence, this theorem also ensures that an H 1 measur- 
able set E c R¢ with ,(E) < co has approximate 1-tangents at 1! a.e point. 


Complementary to this research, where highly irregular sets are permitted, is the 
investigation of rectifiable curves with finite {; energy. These curves have a classic 
tangent 4 a.e. to begin with. It turns out that for s > 3 this guarantees that the curve 
is simple and that the arc length parametrisation is of class C1.1-3/?, which can be in- 
terpreted as a geometric Morrey-Sobolev imbedding (see [83]). It could be shown that 
the space of curves with finite ; for s > 3 is that of Sobolev-Slobodeckij embeddings 
of class W2-2/s)5 (see [6]) and that polygons have finite integral Menger curvature Ms 
exactly for s € (0, 3) (see [66]). Furthermore, results regarding optimal Hélder regular- 
ity could be obtained (see [40]). Related energies have been investigated with regard 
to their regularizing properties (see [85, 82, 13]). 


The movement of quadrilaterals according to their Menger curvature was inves- 
tigated (see [35]) and computer experiments for the gradient flow of integral Menger 
curvature were carried out (see [34]). Besides the ad hoc method used there, the only 
theoretical results regarding discrete versions of the integral Menger curvature that 
we are aware of can be found in the thesis of the author (see [70]). These results are 
still subject to ongoing research and are soon to be extended in an article of the author 
(see [71]). There, the discrete integral Menger curvature of a polygon p with consecutive 
vertices p(a;) = x; is defined by 


n 


x1 -Xpal + |X. —X 
MonD= S> KOaxsxd T] x1 mle a XI 
i,j,k=1 le {i,j,k} 
#{L I,K} =3 


if x; # x; fori # jand Ms,n(p) = else. It is easily seen, that this energy is continuous 
on the space of nonsingular polygons with n segments. As for the Mobius energy, there 
is a IT convergence result: 
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Theorem 5.3.1 (Menger curvature is I’-limit of discrete energies, [70]). For q E 
[1, oo], ll-I] © {ll-llnac, ays I'll w2.a(g, zea) } 4nd every tame knot class K holds 


Ms,n > Ms on (Co,p(K), |I*))- 


Additionally, it could be shown that the energies of inscribed polygons converge to the 
energy of the curve if the number of vertices increases: 


Corollary 5.3.2 (Convergence for inscribed polygons, [70]). Lets € (0,0), 7 € 
@ m C* embedded and py be inscribed equilateral polygons with n segments. Then 
limn—.o Mes,n(pn) = Ms(7). 


5.4 Thickness 


The thickness Aly] of a curve y was introduced by Gonzalez and Maddocks (see [29]) 
as 
Aly] := eee r(y(s), y(t), yu) 

and is equivalent to Federer’s reach (see [25]). As in the previous section, r(x, y, Z) 
is the circumradius of the three points x, y and z. Geometrically, the thickness of a 
curve gives the radius of the largest uniform tubular neighbourhood about the curve 
that does not intersect itself. The ropelength, which is length divided by thickness, is 
scale invariant and a knot is called ideal if it minimizes ropelength in a fixed knot 
class or, equivalently, minimizes this energy amongst all curves in this knot class 
with fixed length. Ideal knots are of great interest, not only to mathematicians but 
also to biologists, chemists and physicists, since they exhibit interesting physical fea- 
tures and resemble the time-averaged shapes of knotted DNA molecules in solution 
(see [80, 37, 38] and [81, 77] for an overview of physical knot theory with applica- 
tions). The existence of ideal knots in every knot class was settled by different teams 
of authors (see [19, 30, 28]) and it was found that the unique absolute minimizer is 
the round circle. Furthermore, this energy is self-repulsive, meaning that finite en- 
ergy prevents the curve from having self intersections. By now it is well-known that 
thick curves, or in general manifolds of positive reach, are of class C!! and vice versa 
(see [45, 25, 72, 46, 67]). It was shown that ideal links must not be of class C* (see 
[19]) and computer experiments suggest that C’! regularity is optimal for knots, too 
(see [88]). Further computer experiments were carried out with the software packages 
SONO, libbiarc and ridgerunner (see [55, 20, 2]). A previous conjecture (see [19, Con- 
jecture 24]) that ropelength minimizers are piecewise analytic seems to be reversed by 
numerical results, which indicate that there might be more singularities than previ- 
ously expected (see [3, 56]). Further interesting properties of critical points as well as 
the Euler-Lagrange equation were investigated (see [72, 73, 18]). 
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Another way to write the thickness of a thick arc length curve is 
Aly] = min { minRad(y), las desd(,)} (5.4.1) 


(see [44, Theorem 1]). The minimal radius of curvature minRad(¥y) of + is the inverse 
of the maximal curvature 


maxCurv(y) := ||x||z and dcsd(y) := min 
(x y)Edcrit(y) 


ly — x| 
is the doubly critical self distance. The set of doubly critical points dcrit(y) of a C! 


curve ¥ consists of all pairs (x, y) where x = y(t) and y = 4(s) are distinct points on > 
so that 


(v0), vd) - (3) = (8), 10 - os) = 0, 


ie. s is critical for u |v(t) = y(u)|? and t for v1 |>(v) - o(s)|°. 


The discrete thickness An, derived from the representation in (5.4.1), was intro- 
duced by Rawdon (see [57]). The curvature of a polygon, localized at a vertex y, is 
defined by 


2 tan($) 


eal ¥2) = Teateyl’ 
2 


where x and z are the vertices adjacent to y and = <(y- x, z—y) is the exterior angle 
at y. We then set 
minRad(p) := maxCurv(p)"? :=_ min Ka (Xs Mis Xa) 
i=1,..., n 

if the polygon p has the consecutive vertices x;, Xo := Xn, Xn+1 = X1. The doubly critical 
self distance of a polygon p is given as for asmooth curve if we define dcrit(p) to consist 
of pairs (x, y) where x = p(t) and y = p(s) and s locally extremizes u > | p(t) - p(u)|* 
and t locally extremizes v > | p(y) - p(s)|’. Now, An is defined analogous to (5.4.1) by 


An|p] = min { minRad(p), 21 desd(p) } 


if all vertices are distinct and An[p] = 0 if two vertices of p coincide. In a series of 
works (see [57, 58, 59, 60, 48]) alternative representations and properties of the dis- 
crete thickness were established. For example, it was shown that the discrete thick- 
ness is continuous: 


Theorem 5.4.1 (Discrete thickness is continuous, [57]). The discrete thickness An is 
continuous on the space of simple closed polygons with n-segments with regard to the 
metric 


n 
d(p, gq) := So\xi-yil, 
i=1 


where p and q are polygons with vertices x; and yj. 
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Another important result is the existence of discrete minimizers: 


Theorem 5.4.2 (Existence of discrete minimizers for An, [60]). For every tame knot 
class K there is an ideal polygonal knot in Pn(K). 


More generally, the result remains true if instead of equilateral polygons one takes the 
space of polygons with a uniform bound on longest to shortest segment length. These 
results were then used to prove the convergence of ideal polygonal to smooth ideal 
knots, a result that could later be improved from C° convergence to C°:! convergence. 


Corollary 5.4.3 (Convergence of ideal polygonal knots, [60, 69]). Let K be a tame 
knot class and pn € Pn(K) bounded in L™ with |inf.p, (4) An’ - An[pn]'| — 0. Then, 
there is a subsequence 

w*™(S:,R’) 


Pn, ——"-> y€ C(K) ~—s with = A* [>] = inf Av’ = lim Ajy[pn,]. 
k—-00 e(K) k-00 


The relationship on a functional level is again captured by a ’-convergence result: 


Theorem 5.4.4 (Inverse thickness is I’-limit of discrete energies, [69]). For every 
tame knot class K holds 


a7 - 
An’ + A* on(€(K), || llwi=(,,R3)) 


Similar questions for more general energies were considered (see [21, 60]). If the knot 
class is not fixed, the unique absolute minimizers of A; is the regular n-gon: 


Proposition 5.4.5 (Minimizers of inverse discrete thickness, [69]). The unique mini- 
mizer of Aj! in Pn is the regular n-gon. 


A.1 Appendix: Postlude in [-convergence 


In this section, we repeat some relevant facts on ’-convergence. For more details, we 
refer to the books by Dal Maso and Braides (see [22, 15]). This notion of convergence 
for functionals was introduced by DeGiorgi and is devised in a way, as to allow the 
convergence of minimizers and even almost minimizers. 


Definition A.1.1 (I-convergence). Let X be a topological space, ¥,Fn : X > R i= 
RU {£00}. Then Fn I'-converges to F, if 

— for every Xn > x holds F(x) s liminfn... Fn(xn), 

— for every x € X there are xn — x with limsup,,_,., Fn(%n) < F(X). 
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The first inequality is usually called lim inf inequality and the second one lim sup in- 
equality. If the functionals are only defined on subspaces Y and Yn of X and we extend 
the functionals by plus infinity on the rest of X, it is enough to show that the lim inf 
inequality holds for every xn € Yn, x € X and the limsup inequality for x €« Y and 
Xn € Yn to establish I’-convergence. We want to use I’-convergence to ensure that mini- 
mizers of the discrete functional Fn converge to minimizers of the “smooth” functional 
Fs 


Theorem A.1.2 (Convergence of minimizers, [22, Corollary 7.17, p.78]). Let Fn, F 
X + Rwith Sn 4 F. Let en > 0, €n + Oand xn € X with inf Fn — Fn(xn)| < En. 
If Xn, — x, then 


F(x) = inf F = lim Fn, (xn,). 
k-00 


Note, that the previous theorem does not imply convergence of a subsequence, but 
instead assumes that we already have such a sequence to begin with. This fact is often 
taken care of by an accompanying compactness result. 


We prove that the integral Menger curvature converges to the inverse thickness, 
to acquaint the reader with a particularly simple case of such a convergence theorem: 


Lemma A.1.3 (Convergence of integral Menger curvature to inverse thickness). The 
integral Menger curvature converges to inverse thickness 


ae er 
M$ 4A on (C(X), |I+ll~)- 


Let K be a tame knot class and let s € (3, e°). Then, there are minimizers ys of Ms in 
(XK). Moreover, we have that 


inf wi/s ——+ inf a7} 
e(K) S>eo C(K) 


and, after translating the minimizers if necessary, there is subsequence such that 
ct 
Ys, ——> 7 € C(K), 
k-00 
where + is an ideal knot, i.e. 


Aly)* = inf At. 
e(K) 


Proof. Using the Holder inequality, it is easy to see the monotonically increasing con- 
vergence 


MG (y) = |KO. 


Ls([0,1}3) s—y00 KO, Ys WDIlt=co.113) = \ 
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An application of Fatou’s Theorem shows that the functionals are lower semi- 
continuous with regard to uniform convergence. Hence, we immediately have I con- 
vergence (see [15, Remark 1.40 (ii), p.35]). For s > 3, curves of finite energy are of 
class C! (see [83]), so that we can restrict to this topology. Minimizers of the energies 
are known to exist and the monotonicity from above gives a uniform bound on, say, 
M, (4s). These facts together with the L® bound imply C subconvergence to a simple 
arc length curve of the same length which belongs to the same knot class (see [83]). 
Now, the result is a consequence of Theorem A.1.2. 
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Khovanov homology and torsion 


Abstract: This article offers an introduction to Khovanov homology with a brief 
overview of the developments in low dimensional topology it has inspired, and rela- 
tions with other link homology theories. The emphasis is placed on the role of torsion 
in Khovanov homology and several open problems. 
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6.1 Introduction 


Categorification is a powerful approach to solving problems in many different areas of 
mathematics. The core idea is that classical objects endowed with some integral struc- 
ture (integers, polynomials with integer coefficients, vector spaces, etc.) can some- 
times be viewed as shadows of new, algebraically richer objects [1, 23]. These newly 
constructed objects, which we refer to as categorifications, should contain more infor- 
mation than the ones we started with. Often categorifications are interesting on their 
own, leading to beautiful and structurally deep mathematics. 

Since the discovery of the Jones polynomial in 1984 [18], the theory of quantum 
invariants for knots, links and 3-manifolds has been developed rapidly, first through 
many generalizations such as the HOMFLY-PT polynomial, then via representation- 
theoretical interpretations and Reshetikhin-Turaev invariants. At the end of the twen- 
tieth century, M. Khovanov [20] constructed a new kind of link invariant: a homology 
theory which lifts the properties of the Jones polynomial and carries an additional rich 
algebraic structure. 

Khovanov homology is a stronger knot invariant than the Jones polynomial, better 
at distinguishing knots than the Jones polynomial it categorifies. It turns out that the 
Khovanov homology of an alternating knot is determined by the Jones polynomial 
and the signature of the knot. Khovanov homology contains more information than 
the Jones polynomial when it comes to non-alternating knots. 

In 2010 P. Kronheimer and T. Mrowka proved that Khovanov homology distin- 
guishes the unknot [27], that is, any knot K whose Khovanov homology is trivial is 
an unknot. The corresponding question for the Jones polynomial remains open. Al- 
though the Jones polynomial is known not to distinguish the 2-component unlink, 
Khovanov homology does [13]. 
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Among the first of many notable applications of Khovanov homology we empha- 
size J. Rasmussen’s [37] concordance invariant, called the s-invariant, that enabled 
him to give a purely combinatorial proof of the Milnor conjecture, also known as the 
Kronheimer-Mrowka theorem. 


Theorem 6.1.1 (J. Rasmussen). The s-invariant provides a lower bound on the slice 
genus of a knot. 


Theorem 6.1.2 (Milnor conjecture). The slice genus of the T(p, q) torus knot is equal 
to 5(p - 1)(q - 1). 


Using the same s-invariant A. Shumakovitch improved the slice-Bennequin inequality 
for some classes of knots [40]. 

Khovanov’s construction led to many significant advances and new discoveries. 
There are several extensions of Khovanov homology, including odd and reduced Kho- 
vanov homology, as well as symplectic and bordered Khovanov homology which have 
been studied since. Other knot polynomials such as HOMFLYPT, Kauffman 2-variable, 
and sI(N) link polynomial have also been categorified. Motivated by the Floer homol- 
ogy, Ozsvath, Szab6, and Rasmussen constructed knot Floer homology that categori- 
fies the Alexander polynomial. Soon after, this categorification was given a purely 
combinatorial description by Manolescu, Ozsvath and Sarkar [30]. This abundance 
of new structures has provided many answers to classical questions in 3- and 4- 
dimensional topology. 


6.2 Definition and structure of Khovanov link 
homology 


Khovanov link homology is a functor that assigns to every link L bigraded homol- 
ogy groups Kh(L) whose Euler characteristic is the Jones polynomial of L, and to link 
cobordisms homomorphisms of homology groups. Here we define Khovanov homol- 
ogy in broad strokes, mostly relying on Bar Natan’s cube construction [3]. For details 
check one of the following excellent resources [1, 3, 44, 45, 43], each of which empha- 
sizes different aspects of Khovanov link homology. 

This construction is based on the Kauffman state sum expression for the Jones 
polynomial, known as the Kauffman bracket polynomial [19]. This formula can be used 
instead of the skein relation to express the Jones polynomial as a sum of powers of 
binomials (q + ae which is the Jones polynomial of the unknot. 

Let D be an n-crossing diagram ofa knot K. A Kauffman state s of the knot diagram 
D is a function from the set of crossings {C;}!., of D to the set {0, 1}. Diagrammati- 
cally, we assign to each crossing of D a marker according to the convention in Figure 


Khovanov homology andtorsion —— 127 


IC K+ 


Fig. 6.1: The smoothings of a crossing and the corresponding arcs in the Kauffman state circles. 


6.1, where each crossing is replaced by two parallel arcs. The Kauffman state can be 
viewed as a collection of circles D; embedded in the plane obtained by smoothing 
each crossing in D. Let |Ds| be the number of circles in Ds. 

There are 2” Kauffman states that can be conveniently organized as the vertices of 
an n-dimensional unit cube J" = [0, 1]". Each vertex V € I” of the cube corresponds 
to one Kauffman state s: the state that associates a 0 or 1 smoothing to a crossing C; 
depending on whether the i-th coordinate of that vertex is 0 or 1. 


Fig. 6.2: Cube of resolutions in Khovanov homology of a Hopf link. 


Let A, = Z[x]/(x* = 0) be a graded module with 1 in bidegree (0, 1) and x in 
bidegree (0, -1). A» has the Frobenius algebra structure including the multiplication 
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and comultiplication maps: 


m:A7z@A2 > Ar 1@1»1 
1@x,x@1lrwx 
X@XH0 

A:Az > Ar ® Ad 11@x+xe@l 


Xr>X@X 


To each vertex assign a module C(D;) = C(D(V)) = ASPs [h(s)]{q(s)} with ap- 
propriate homological [h(s)] and height (quantum) {q(s)} shifts [3]. Next, let e be 
the edge of the cube J” between vertices V and W whose coordinates are the same 
except in one place where W has entry 1 and V has 0. We define the per edge map 
de : C(D(V)) — C(D(W)) for each such edge e in the following way: 


— ifthe number of Kauffman circles in the Kauffman state of vertex W is smaller than 
the number of circles in the state associated to V then de is a multiplication on the 
corresponding copies of Az 

— otherwise d, is a comultiplication A on the respective tensor factors and all other 
tensor factors are mapped by the identity map. 


The structure we have described is referred to as the cube of resolutions and the 
Khovanov chain complex is its total complex, Figure 6.2. The Khovanov chain groups 
KhC'(D) are obtained by taking direct sums of modules along the hyperplanes orthog- 
onal to the main diagonal and defining the differential d ?s KhC(D) > KhC(D) to 
be a signed sum of edge maps de connecting vertices in I” with i ones in their Kauff- 
man states to the ones with i + 1 ones that differ only in one value of the Kauffman 
state. The homology of the Khovanov chain complex for a link L is called Khovanov 
link homology and denoted by Kh(Z). 


Theorem 6.2.1. Khovanov homology Kh(L) of a link L is independent of the choice of 
diagram i.e. Khovanov homology is a link invariant. 


Kh(L) recovers information about L known from the Jones polynomial, for example: 


— Thespan of the Jones polynomial of L gives a lower bound on the crossing number 
of L. The span of Kh(L), the difference between the highest and the lowest non- 
trivial quantum grading, gives a lower bound on the crossing number as well. 

— Not only does the Euler characteristic of the Khovanov homology equal the Jones 
polynomial; in fact, there is a long exact sequence in Khovanov homology which 
lifts the skein relation which defines the Jones polynomial. 


Khovanov homology is indeed much richer in structure than the Jones polyno- 
mial. In particular, it carries the following additional structures: 
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— torsion, applications of which are discussed in the remainder of this article; 

— the Lee and other spectral sequences; 

— Steenrod squares and other higher homological operations by Lipshitz-Sarkar, 
Kriz-Kriz-Po; 

— due to the functorial nature of Khovanov homology, invariants of cobordisms ex- 
ploited by M. Jacobson, D. Bar Natan, M. Khovanov, and Clark-Morrison-Walker. 


Khovanov homology also appears in mathematical physics in the work of Gukov, 
Schwarz, Vafa, and Witten, where it admits a description in terms of so-called BPS 
states. 


6.3 Torsion of Khovanov link homology 


Recall that the Jones polynomial equals the Euler characteristic of Khovanov homol- 
ogy, so the Jones polynomial depends only on the free part of Khovanov homology. 
This section is devoted to results concerning torsion of Khovanov homology inter- 
twined with the ideas and methods that could be useful in advancing our understand- 
ing of torsion and the additional information it may carry about knots and their cobor- 
disms. Although computations hint at the abundance of torsion, describing and un- 
derstanding torsion in Khovanov homology is still elusive. A great deal has been writ- 
ten about torsion of order 2; however, torsion of higher order is still a mystery [3, 5, 11]. 
Torsion in Khovanov homology is understood only for some classes of knots and links 
[2, 34, 36, 41, 42]. 


Conjecture 6.3.1 (A. Shumakovitch). The Khovanov homology of every link except the 
unknot, the Hopf link, their disjoint unions and connect sums, has torsion of order 2. 


As another illustration of the importance of torsion in Khovanov homology, note 
that the affirmative resolution of Shumakovitch’s conjecture would provide a new 
proof that the Khovanov homology is an unknot-detector. The only currently known 
proof of this fact due to P. Kronhaimer and T. Mrowka [27] relies on the relations with 
the instanton Floer homology. 


Goal1. Characterize and compute torsion in Khovanov homology. 


Approach 1. Spectral sequences for constructing odd torsion in link homology 

A first step to understanding the torsion in Khovanov homology is to construct links 
whose Khovanov homology has torsion of a prescribed order. A. Shumakovitch re- 
cently used the Bockstein spectral sequence to compute all of the torsion in the Kho- 
vanov homology of alternating knots. In particular, he proves Conjecture 6.3.1 for al- 
ternating knots and shows that Khovanov homology of thin knots has only Z>-torsion. 
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Fig. 6.3: Khovanov homology of torus knot 7(6, 5). Integer entry represents the rank of the group; 
integer with subscript stands for torsion of that order. For example, Kh*?35(T(6, 5)) = Zz @ Zs. 


Example 1. Examples of knots with Z;-torsion in their Khovanov homology in- 
clude the 5-strand torus knots: T(6,5), T(7,5), T(8,5) and T(9,5) but in gen- 
eral appear to be quite rare, see Figure 6.3. Based on the results in [36], J. Przy- 
tycki and the author predicted in 2012 that the Khovanov link homology of 
the positive adequate 36-crossing knot K given by the closure of the braid 
$183S7S3S1$3 8584538 1$3S7535}$383S,85 Will contain Z; torsion. This conjecture was 
confirmed by A. Shumakovitch using JavaKh [5]. 

D. Bar-Natan showed that Khovanov homology of the T(8, 7) torus knot contains 
Z7, Zs, Zy4, and Z-torsion. This 48 crossing knot reaches the limits of current com- 
putational resources [26, 6]. Extending theoretical results to non-alternating links in 
order to find Zy-torsion for odd primes p will require either a more sophisticated spec- 
tral sequence or perhaps arguments of a different nature altogether. 


Approach 2. Relations between Khovanov link, Hochschild algebra and chro- 
matic graph homology theories 

Hochschild homology is a well-understood cyclic homology theory of associative 
algebras. Przytycki observed, somewhat surprisingly, that Khovanov homology and 
Hochschild homology share a common structure [35]. The limit of the Khovanov ho- 
mology of T(2, n) torus links when n is large can be interpreted as the Hochschild 
homology of the algebra A, = Z[x]/(x* = 0) and some of the torsion of Hochschild 
homology can be seen in Khovanov homology of T(2, n). 

The cyclic nature of Hochschild homology can be interpreted pictorially. The n-th 
chain group can be pictured as an n-cycle. The differential corresponds to contracting 
the edges of n-cycle one by one to get an (n - 1)-cycle and multiplying the elements in 
the copies of A labelling the endpoints of the contracted edge. 


Khovanov homology and torsion —— 131 


For every planar diagram D of a link L there exists an associated planar graph 
I(D) obtained via the checkerboard black and white coloring of regions of D. Vertices 
of I'D) are black regions of D and edges correspond to crossings of D. We generalize 
this construction in the following way. 

Let D be a knot diagram. The Kauffman state graph [(D,)/I'(Dg) contains a ver- 
tex for each circle in the all-A/all-B Kauffman state D,/D , and an edge if two circles 
in D,/Dgz have arcs obtained by smoothing a single crossing in D. Checkerboard col- 
orings of an alternating knot diagram correspond to '(D,) and I'(D,). 

The graph I'(D,) corresponding to the T(2, n) torus link and Kauffman state send- 
ing all crossings to 1 is just an n-cycle. The chromatic graph homology HGR 4, (I) is 
one of several categorifications of polynomial graph invariants, and one of two cate- 
gorifications of the chromatic polynomial [10, 15]. Given a graph I the chromatic graph 
polynomial P(T) satisfies the recursive deletion-contraction relation 


P(T) = P( - e) - P(I/e), (6.3.1) 


where I - e is the graph obtained by deleting an edge e froma graph I, and ’/e is the 
graph obtained by contracting the same edge. After categorification, this relation lifts 
to long exact sequence of chromatic graph cohomology groups [15]: 


... > HGR',, (0) > HGRiy, (0 - e) > HGR',,(C/e) > HGRA) >... (6.3.2) 


The construction of HGR ,, (I) follows that of Khovanov: a bigraded homology theory 
H GR} (I) is associated to a graph Fr and a commutative (graded) algebra A», in such 
a way that its (graded) Euler characteristic is the value of the chromatic polynomial at 
the graded dimension of the algebra A. 

In terms of the chromatic graph homology, the relation between Khovanov and 
Hochschild homology can be expressed by saying that the chromatic graph homology 
of the cycle of length n is the Hochschild homology of algebra A = Z[x]/(x* = 0) 
through a range of dimensions increasing with n. 


Theorem 6.3.2 (Correspondence between Khovanov and chromatic homology). Let 
I(D,) be the graph associated to diagram D of a link L whose girth, the length of the 
shortest cycle, £ = &(G) > 1. Then the first £ Khovanov homology Kh(L) groups of L and 
chromatic graph homology HGR ,,(I(Da)) have isomorphic torsion [34]. 


This point of view enables us to describe some Z -torsion in the Khovanov ho- 
mology of certain classes of links via torsion in chromatic homology [2, 14, 34, 36] 
expressed in terms of combinatorial data of one of the corresponding graphs. For ex- 
ample, the torsion depends on the cyclomatic number that is the rank of the first ho- 
mology of that graph as a 1-dimensional CW-complex. 


Proposition 6.3.1. Given a diagram D of a knot or link L such that the corresponding 
graph I'(D,) isa connected graph with no loops. Let I’ (D,) be the graph obtained from 
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T'(Da) by removing all multiple edges and p,; = p:(I’ (Da)) be the cyclomatic number 
of I’(D,). Then 


Z», if f(D,) hasan odd cycle; 
torHy_2,n+2|D,, |-4(L) : ze i 


0, if I(D,) isa bipartite graph. 


fOr ipsoipnjca) ieee if (D4) has an odd cycle; 
n-4,n+2|Ds, |- 


Z, if (Da) isa bipartite graph. 


This proposition establishes Shumakovitch’s conjecture for all adequate and 
semi-adequate knots whose corresponding graphs have odd cycles. Chromatic homol- 
ogy, like Khovanov homology of alternating links, is supported only along two diag- 
onals [9], with torsion appearing only on one. An interesting computational problem 
would be to explicitly compute all torsion along the whole diagonal in chromatic ho- 
mology; pulling this information back via Theorem 6.3.2 would greatly extend our cur- 
rent knowledge of the torsion in Khovanov homology. 


Approach 3. Compute torsion via Khovanov tangle invariants 

In what we have described so far, computing Khovanov homology requires global 
knowledge of the topology of Kauffman states. Instead, one can take a different ap- 
proach and decompose a link diagram into local pieces called tangles. While it is 
straightforward to piece together tangles to obtain a link, it is quite challenging to 
piece together any of the existing Khovanov tangle invariants [21, 22, 4] to obtain the 
Khovanov homology of the link. Roberts [38, 39] constructs a Khovanov tangle invari- 
ant whose algebraic structure is inspired by the bordered Heegaard Floer homology. 
Most importantly, Roberts obtains gluing formulas which he uses to construct the Kho- 
vanov homology of a link from the Khovanov invariants of the tangles in its decom- 
position. This approach may be most suitable for studying the Khovanov homology of 
classes of links with especially nice tangle decompositions, such as almost-alternating 
links and closed 3-braids. 


Goal 2. Describe topology of the link in terms of torsion of Khovanov homology 
With a discovery of the new knot invariant one of the most intriguing questions is de- 
termining how much it can tell us about knots. In our framework, this translates into 
the question: what does torsion in Khovanov homology reveal about topological prop- 
erties of the link. As mentioned above, it is conjectured that the existence of torsion 
in the Khovanov homology of a knot implies the knot is nontrivial. Moreover, com- 
putations indicate a potential relationship between the existence of Zy-torsion in the 
Khovanov homology of a link and the braid index of the link. However, the infinite 
family of links with braid index four containing Zs -torsion with s < 23 constructed 
in [31] provides a counterexample to the PS-conjecture [36] and hints that such a rela- 
tion, if it exists, will be more subtle. 
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Fig. 6.4: Cobordisms of smoothings of a crossing. 
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6.4 Homological invariants of alternating and 
quasi-alternating cobordisms 


Khovanov homology is functorial, up to overall minus sign, under link cobordisms. 
However, in general, it is extremely difficult to compute homomorphisms of Khovanov 
homology induced by link cobordisms. 

Notice that the number of spanning trees s(I) ina graph I satisfies a recursive re- 
lation similar to the deletion-contraction formula for the chromatic polynomial (6.3.2): 


s() = si - e) + s(I/e). (6.4.1) 


Spanning trees of [(D,) play a fundamental role in link homology [8, 46]. The 
reduced Khovanov homology of L can be computed via a complex generated by span- 
ning trees of '(D,) but the combinatorial form of the differential is currently unknown. 
The same property holds for the knot Floer homology of L. When D is a minimal al- 
ternating diagram, differentials in both complexes are zero, and the rank of reduced 
Khovanov and knot Floer homologies of L equals the number of spanning trees in 
I(D,), and also the determinant of L [32]. 


Goal 3. Compute homomorphisms of Khovanov homology induced by “alter- 

nating" cobordisms between alternating links 

The spanning tree model gives a good inductive realization of Khovanov homology, 

and maps induced by alternating cobordisms could be effectively computable. 

The alternating cobordisms are those generated by the cobordisms illustrated in 6.4: 

they arise from smoothing a crossing, saddles, and births and deaths of circles. 
Khovanov-Rozansky s((n) link homology [24, 25] and its variation due to Mackaay, 

Stosi¢c, Vaz [28], are based on homology theories of planar graphs, functorial under 

foam cobordisms (in R x [0, 1]) between these graphs. Similarly one could search for 
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an interpretation of alternating cobordisms, which are realized by surfaces embedded 
in R*, in terms of cobordisms between checkerboard graphs of alternating diagrams. 
Cobordisms between planar graphs induced by alternating cobordisms will resemble 
foams. The same idea could be employed towards computing maps between link Floer 
homologies of alternating links induced by alternating cobordisms, or the maps in- 
duced by lifts of alternating cobordisms to the double branch cover. 


Goal 4. Find a combinatorial framework for quasi-alternating knots 
Quasi-alternating links are generalizations of alternating links, defined in an inductive 
way. 

The set 9 of quasi-alternating links is the smallest set of links including the unknot 
and satisfying the following property: Ifa link L has adiagram D witha crossing c such 
that both smoothings, Lo and Lj, of c are in Q, and det(L)=det(Lo)+det(L,), then L is 
in Q [33, 7, 29]. 

The notion of quasi-alternating link is hard to capture combinatorially. The initial 
search for quasi-alternating link families was performed using the knot theory soft- 
ware LinKnot [17, 16] and the computational results have already been used to show 
that certain knots and links are not quasi-alternating [12]. It would be very exciting to 
find a combinatorial model for some large class of quasi-alternating links, incorporat- 
ing inductive steps in their construction. Such a combinatorial model should also give 
a generalization of planar graphs (checkerboard graphs of alternating link diagrams) 
to more subtle higher-dimensional structures. 

Furthermore, the Khovanov and knot Floer homology of quasi-alternating links 
are easy to compute, due to the vanishing of the differentials in short exact sequences 
that build up to homology groups of these links. Quasi-alternating cobordisms be- 
tween quasi-alternating links can be defined as compositions of cobordisms built out 
of quasi-alternating crossings. 
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Abstract: A quadrisecant line is one which intersects a curve in at least four points, 
while an essential secant captures something about the knottedness of a knot. This 
survey article gives a brief history of these ideas, and shows how they may be applied 
to questions about the geometry of a knot via the total curvature, ropelength and dis- 
tortion of a knot. 
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7.1 Introduction 


In this survey article, we consider the ways in which lines intersect knots. Some of 
these capture the knottedness of a knot. Trisecant and quadrisecant lines are straight 
lines which intersect a knot in at least three, respectively four distinct places. It is 
clear that any closed curve has a 2-parameter family of secants. A simple dimension 
count shows there is a 1-parameter family of trisecants and that quadrisecants are 
discrete (O-parameter family). A planar circle unknot does not have any trisecants 
and quadrisecants, however nontrivial tame knots must have them. In general, we 
do not expect knots to have quintisecants (or higher order secants), they exist only for 
a codimension 1 (or higher) set of knots. Note that these statements are only valid for 
generic knots (for example, polygonal knots with extra conditions like no four points 
are coplanar). 

The existence of quadrisecants and essential secants gives insight into the geom- 
etry of knots. Section 7.2 gives definitions of quadrisecants and essential secants and 
a brief history of known results. It ends with a discussion of the open question of find- 
ing bounds on the number of quadrisecants for a given knot type. Section 7.3 gives a 
brief outline of the tools used in many of the proofs of results about quadrisecants. 
Section 7.4 shows how the ideas may be applied to results about the geometry of knots 
via the total curvature, second hull, ropelength and distortion of a knot. 
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7.2 Quadrisecants 


Recall that a knot is a homeomorphic image of S! in R*, modulo reparametrizations, 
and a tame knot is one that is ambient isotopic to a polygonal knot. 


Definition 7.2.1. Given a knot K, an n-secant line is an oriented line which intersects 
K in at least n components. An n-secant is an ordered n-tuple of points of K (no two of 
which lie in a common straight subarc of K) which lie in order along an n-secant line. 


As previously described, by a secant we mean a 2-secant, by a trisecant a 3-secant, 
and by a quadrisecant a 4-secant. For a closed curve K, any two distinct points de- 
termine a straight line. These points are a secant if and only if they do not lie ona 
common straight subarc of K. Thus the set of secants S = K? \ A is topologically an 
annulus. (Here A denotes the set of n-tuples in which some pair of points lie in a com- 
mon straight subarc of K.) 

We now consider the set of trisecants of a knot, denoted J c K? \ A. For any 
trisecant abc, there are |S3/C3| = 2 cyclic orderings of the oriented knot and trisecant. 
We could label these by their lexicographically least elements (abc or acb), but we call 
them direct and reversed respectively. Figure 7.1 illustrates the two types of trisecant. 
Flipping the orientation of the knot or the trisecant would change its type. 


@- 0 


Fig. 7.1: These trisecants are reversed (left) and direct (right) because the cyclic order of the points 
along K is acb and abc respectively. 


Just as with trisecants, we may compare the points of a quadrisecant line with their 
ordering along the knot. For quadrisecant abcd, the order along K is a cyclic order, 
and ignoring the orientation of K is just a dihedral order. Thus there are |S,/D,| = 3 
dihedral orderings of a quadrisecant and non-oriented knot. We can represent these 
equivalence classes by abcd, abdc and acbd, where we have again chosen the lexi- 
cographically least order as the name for each. Figure 7.2 illustrates these orderings. 


Definition 7.2.2. Quadrisecants of type acbd are called alternating quadrisecants. 
Quadrisecants of types abcd and abdc are called simple and flipped respectively. 
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Fig. 7.2: From left to right, quadrisecants abcd are simple, flipped and alternating. 


When discussing quadrisecant abcd, we will usually choose to orient K so that b € 
‘Yad: This means that the cyclic order of points along K will be abcd, abdc, or acbd, 
depending on the type of quadrisecant. 


7.2.1 Essential secants 


Before discussing known results about quadrisecants, we pause to introduce the no- 
tion of an essential secant. (The definitions in this subsection can all be found in 
Denne et al. [14].) G. Kuperberg [38] first introduced this idea (which he called “topo- 
logically nontrivial”) in his paper about quadrisecants. Being essential captures part 
of the knottedness of the knot. 

Generically, the knot K together with the secant segment between two points on 
K forms a knotted 9-graph in space (that is, a graph with three edges connecting the 
same two vertices). 


Definition 7.2.3. Suppose a, f and ¥ are three disjoint simple arcs from points a to b, 
forming a knotted @-graph, as illustrated in Figure 7.3. Let X := R? \ (aU y), and let 
6 be a parallel curve to a U B in X. (By parallel we mean that a U B and 6 cobound an 
annulus embedded in X.) We choose 6 to be homologically trivial in X (that is, so that 
6 has zero linking number with a U 7). Let h = h(a, B, y) © 72,(X) denote the (free) 
homotopy class of 6. Then (a, , +) is inessential if h is trivial. We say that (a, B, +) is 
essential if it is not inessential. 


In other words, the ordered triple (a, 8, +) is inessential if there is a disk D bounded by 
a UB having no interior intersections with the knot a U . (We allow self-intersections 
of D, and interior intersections with f, as will be necessary if a U B is knotted.) 

This notion is clearly a topological invariant of the (ambient isotopy) class of the 
knotted 0-graph. We apply this definition to arcs of a knot K below. But first, some 
useful notation. Let a, b € K. The arc from a to b following the orientation of the knot 
is denoted 7,, and has length ¢,,. The arc from b to a, y,, with length ¢,, is similarly 
defined. The secant segment from a to b is denoted ab. 
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Fig. 7.3: In the knotted O-graph aUB U7, the ordered triple (a, B, +) is essential. To see this, we find 
the parallel 6 to a U B which has linking number zero with a U ¥ and note that it is homotopically 
nontrivial in the knot complement R? \ (a U 4). In this illustration, B is the straight segment ab, so 
we may equally say that the arc a = +p of the knot a Uy is essential. 


Definition 7.2.4. Assume K is a nontrivial tame knot, a,b € K, and £ = ab. We say 
‘Yap is essential if for every € > 0 there exists some e-perturbation of ¢ (with endpoints 
fixed) to a tame curve / such that K U @ forms an embedded @ in which (yp, £ Ypq) 
is essential. 


Note that this definition is quite flexible, as it allows for the situation where K inter- 
sects £. In [14] Proposition 6.2, we show that it also ensures that the set of essential 
secants is closed in S. 


Definition 7.2.5. A secant ab of K is essential if both subarcs y,) and yp, are essential. 
Otherwise it is inessential. Let ES be the set of essential secants in S. 


It is straightforward to see that if K is an unknot, then any arc yg), is inessential. (Be- 
cause the homology and homotopy groups of X := R? \ K are equal for an unknot, 
then any curve having zero linking number with K is homotopically trivial in X.) We 
can use Dehn’s Lemma to prove a converse statement: if a,b € K and both y,, and 
Ybq are inessential, then K is unknotted. (See for instance [10, 14].) 

Later, in applications of these ideas, we need to find the least length of an essential 
arc, and use this to get better bounds on ropelength and distortion. This leads us to 
consider what happens when arcs change from inessential to essential. 


Theorem 7.2.6 ([14] Theorem 71). Suppose ~yac is in the boundary of the set of essen- 
tial arcs for a knot K. (That is, vac is essential, but there are inessential arcs of K with 
endpoints arbitrarily close to a and c.) Then K must intersect the interior of segment ac, 
and in fact there is some essential trisecant abc. 
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Finally, to call a quadrisecant abcd essential, we could follow Kuperberg and require 
that the secants ab, bc and cd all be essential. But instead, we require this only of 
those secants whose endpoints are consecutive along the knot 


Definition 7.2.7. An n-secant a,a)...dn is essential if we have a;a;,, essential for 
each i such that one of the arcs ya,a;,, aNd ~a;,,a; includes no other qj. 


That is, for simple quadrisecants, all three secants must be essential; for flipped 
quadrisecants the end secants ab and cd must be essential; for alternating quadrise- 
cants, the middle secant bc must be essential. 


7.2.2 Results about quadrisecants 


The simple dimension count outlined in Section 7.1 means that we expect nontrivial 
tame knots to have quadrisecants. Indeed, in 1933, E. Pannwitz [44] first showed that 
every nontrivial generic polygonal knot in R? has at least 2u” quadrisecants, where u 
is the unknotting number of a knot”". (It is entirely possible that Heinz Hopf suggested 
this problem to her. In the endnote to his paper, I. Fary [22] mentions that Hopf used 
quadrisecants to prove that knots have total curvature greater than or equal to 47.) 

In the early 1980’s, H.R. Morton and D.M.Q. Mond [40] rediscovered Pannwitz’ 
result. They independently proved that every nontrivial generic knot has a quadrise- 
cant, and they conjectured that a generic knot with crossing number n has at least () 
quadrisecants. It was not until 1994 that Kuperberg [38] managed to extend the re- 
sult and showed that all (nontrivial tame) knots in R? have a quadrisecant. To do this, 
he introduced the notion of an essential secant. In 1998, C. Schmitz [48] nearly proved 
that alternating quadrisecants exist for nontrivial tame knots in Hadamard manifolds, 
but in his proof some quadrisecants may degenerate to trisecants. 

In 2004, the following result was proved in my PhD thesis. 


Theorem 7.2.8 ((16]). All nontrivial tame knots have at least one alternating quadrise- 
cant. All nontrivial knots of finite total curvature have at least one essential alternating 
quadrisecant. 


Also in 2004, R. Budney, J. Conant, K.P. Scannell and D. Sinha [6] gave a geometric 
interpretation of the second order Vassiliev invariant. To do this, they used the tech- 
niques of compactified configuration spaces and Goodwillie calculus to show that this 
invariant can be computed by counting alternating quadrisecants with appropriate 


7.1 She actually stated her result in terms of the knottedness of K, which is the minimal number of 
singular points on the boundary of locally flat singular spanning disks of K, and is twice the unknotting 
number. 
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multiplicity (both for long knots and closed knots). This result implies the existence 
of alternating quadrisecants for many knots, while Theorem 7.2.8 shows existence for 
all nontrivial tame knots. 

Later in 2013, Budney’s Master’s student G. Flowers [25] gave a natural extension 
of the quadrisecant count formula found in [6] to closed knots. Here, Flowers counts 
five- and six-point cocircularities instead of counting quadrisecant lines. The relative 
ordering of the knot and intersecting circle are again central to the arguments, and 
the idea of an alternating quadrisecant becomes that of either a ‘satanic’ or ‘thelemic’ 
circle. 

In 2007, M. Sommer developed a wonderful jReality application Visualization in 
Geometric Knot Theory [50] as part of his Diploma Thesis [51]. This program allows the 
user to view the set of trisecants for polygonal knots, and even see how it changes as 
a vertex is moved. The program also shows the different projections of this set onto 
different planes, from which the user can read off the different quadrisecants. 

In 2008, T. Fiedler and V. Kurlin [24] viewed quadrisecants in a different way. They 
fixed a straight line in R? anda fibration around it by half-planes, and studied knots in 
general position with respect to this fibration using secants and quadrisecants lying 
in fibers. They give the minimum number of all fiber quadrisecants and fiber extreme 
secants which occur during an isotopy from one knot to another in terms of a sum of 
unordered and coordinated writhes of a particular kind of projection. 

In 2009, J. Viro [54] estimated from below the number of lines meeting each of four 
disjoint smooth curves in both RP? and R?. In R? her arguments may be translated 
to quadrisecant lines, however her count involves linking numbers and appears to be 
different from the one in [6]. 


7.2.3 Counting quadrisecants and quadrisecant approximations. 


For a generic nontrivial knot, we expect that there are a finite number of quadrise- 
cants. (Indeed, this was one of the results of Budney et al. [6].) The open and very 
challenging problem is to give bounds on the number of quadrisecants for a given knot 
type. Many of the papers cited in the previous section relate the number of quadrise- 
cants to some topological invariant of knots (unknotting number, finite type invari- 
ants, writhe), or give conjectures about what this might be. However none of the 
known results seem to be close to the number of observed quadrisecants for knots 
with large crossing number. Indeed, there are not even conjectures about asymptotic 
bounds for the number of quadrisecants for particular families of knots. 

There have been several papers which have grappled with this question for knots 
with small crossings. In 2005, G.T. Jin [35], gave trigonometric parametrizations of the 
31,41, 5, and 5> knots and found all quadrisecants for these parametrizations. He 
also defined the quadrisecant approximation of a knot K. 
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Definition 7.2.9. Let K be a knot with finitely many quadrisecants intersecting K in 
finitely many points. These points divide K into a finite number of subarcs. Replacing 
the subarcs by straight lines gives a polygonal closed curve called the quadrisecant 
approximation of K. 


Jin then conjectured the following. 


Conjecture 7.2.10 ([35]). Ifa knot K has finitely many quadrisecants, then the quadrise- 
cant approximation K has the knot type of K. Furthermore, K and K have the same set 
of quadrisecants. 


In 2011, Jin and S. Park [36] proved that every hexagonal (6-sided) trefoil knot has 
exactly three alternating quadrisecants. They then proved that the quadrisecant ap- 
proximation of a hexagonal trefoil knot is also a trefoil knot, and moreover has the 
same three alternating quadrisecants as the original knot. 

Most recently, S. Bai, C. Wang and J. Wang [3] proved that the Conjecture 7.2.10 
is false. They gave two examples of an unknot: the first has a quadrisecant approx- 
imation which is not even an embedded curve, and the second has a quadrisecant 
approximation which is a left-handed trefoil knot. They then used a particular type 
of connected sum operation to show that there is a polygonal knot in every knot type 
whose quadrisecant approximation is either not embedded, or contains a trefoil sum- 
mand. 

There are still many interesting questions to explore here. For example, it is en- 
tirely possible that Conjecture 7.2.10 is true for polygonal knots which have minimum 
stick number, or for ideal knots (those which minimize some kind of energy like rope- 
length). 

Returning to the question of counting quadrisecants, A. Cruz-Cota and T. Ramirez- 
Rosas [12] recently proved the first result giving an upper (rather than a lower) bound 
on the number of quadrisecants. 


Theorem 7.2.11 ((12]). Let K be a polygonal knot in general position, with exactly n 
edges. Then K has at most - (n - 3)(n - 4)(n — 5) generic quadrisecants. 


We finish this section by noting that the results of [6] are the only ones relating a count 
of alternating quadrisecants to a knot invariant. We have work-in-progress [13] show- 
ing that there is at least u(K) number of alternating quadrisecants of a knot K, where 
u(K) is the unknotting number of K. Examples from [35] have lead us to conjecture the 
following (say for knots of finite total curvature). 


Conjecture 7.2.12. The figure-8 knot has at least one essential flipped quadrisecant. 
The 52 knot has at least one essential simple quadrisecant. 
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7.3 Key ideas in showing quadrisecants exist 


Observe that quadrisecants are formed when several trisecants share common points. 
Quadrisecant abcd includes four trisecants (1)abc, (2)abd, (3)acd, (4)bcd. Pan- 
nwitz [44] showed quadrisecants exist by looking for pairs of trisecants like (1)abc 
and (3)acd, where the first and third points of trisecant abc are the same as the first 
and second points of trisecant acd. Kuperberg [38] showed that quadrisecants exist by 
looking for pairs of trisecants like (2)abd and (3)acd, where the first and third points 
of the trisecants are the same. We proved [16] that alternating quadrisecants exist by 
looking at families (1)abc and (2)abd, where the first and second points of the trise- 
cants are the same. The quadrisecant count in Budney et al. [6] boiled down to looking 
at linking numbers of these sets of trisecants in the 3rd associahedron (Stasheff poly- 
tope) — a compactified configuration space. 

All of these arguments presuppose that nontrivial knots have trisecants. Fortu- 
nately this is easy to prove, as was originally shown by Pannwitz. 


Lemma 7.3.1 ([44]). Each point of a nontrivial tame knot K is the first point of at least 
one trisecant. 


Proof. Suppose there is a point a € K which is not the start point of any trisecant. 
The union of all chords ab for b € K is a disk with boundary K. If two chords ab 
and ac intersect at a place other than a, then they overlap and one is a subinterval 
of another. They form a trisecant (abc or acb), contrary to the assumption. Thus the 
disk is embedded and K unknotted, a contradiction. 


The structure of the set of trisecants (and quadrisecants) can be understood more eas- 
ily when we restrict the class of knots considered, such that the restricted class is dense 
in the set of tame knots. All of the previous work on quadrisecants we have discussed 
does this, exactly how depends on the context chosen. We choose to work with polyg- 
onal knots with some extra assumptions. 


Definition 7.3.2. We say that the polygonal knot K in R? is generic if the following 
conditions are satisfied 
— No four vertices of K are coplanar, no three vertices of K are collinear. 
— Given three pairwise skew edges of K, no other edge of K is contained in the 
quadric surface generated by those edges. 
— There are no n-secants for n = 5. 


Once the existence of essential quadrisecants is established for generic knots, a limit 
argument (see [38]) is used to show that any nontrivial tame knot has a quadrisecant. 
The key point here is that the limit of an essential secant remains a secant — it does 
not degenerate in the limit. 
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7.3.1 Trisecants and quadrisecants. 


We now discuss how trisecants and quadrisecants can arise in generic polygonal 
knots. The first condition in Definition 73.2 means that three (or more) adjacent edges 
cannot be coplanar. Suppose e; and e;,; are adjacent edges. A 1-parameter family 
of trisecants arises when a third edge e; of the knot intersects certain parts of the 
plane spanned by e; and e;,,. This family is either homeomorphic to [0, 1] or [0, 1) 
depending on which region e; intersects (see Figure 7.4). A quadrisecant is formed 
when a fourth edge intersects one of the trisecant lines. Genericity implies that two 
non-adjacent edges cannot be coplanar, thus there can be at most one quadrisecant 
in this case. 


Fig. 7.4: The planes are spanned by edges e;, e;,,, On the left, there are trisecant lines intersecting 
€j, Cis: if the third edge intersects any one of the five regions marked (*), otherwise there are no 
trisecants. On the right, the family of trisecants intersecting e;, e;, @j41 is homeomorphic to [0, 1], 
while the family intersecting e;, ej11, e, is homeomorphic to [0, 1). 


Before moving on to the next case, we pause to remind the reader about some 
well known facts about doubly ruled surfaces (see for instance [31, 47, 43]). A triple of 
pairwise skew lines 1,, 15, 1; determines a unique quadric, a doubly ruled surface H (as 
in the blue lines in Figure 7.5). This is either a hyperbolic paraboloid, if the three lines 
are parallel to one plane, or a hyperboloid of one sheet, otherwise. Each point of H lies 
ona unique line from each ruling. The lines 1), 15, 13 belong to one of the rulings of the 
surface, and every line intersecting all three lines belongs to the other. Thus a fourth 
line 1, intersecting H yields two (or one) quadrisecant line(s) intersecting 1,, Io, 13, 
and |,. There are an infinite number of quadrisecants only when 1, is contained in H. 

Returning our attention to generic polygonal knots, we see that a pairwise skew 
triple of edges e;, e;, e, either has 0 or 1 trisecant, or has a 1-parameter family of trise- 
cants. A quadrisecant is formed when a fourth edge intersects the quadric in a trise- 
cant line. There are 1 or 2 quadrisecants in this case, since the second condition in 
Definition 7.3.2 means that there cannot be an infinite number of quadrisecants. 
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Fig. 7.5: Doubly ruled surfaces produced in Mathematica. On the left a hyperboloid of one sheet, on 
the right a hyperbolic paraboloid. The three blue lines 1, l2, /3 lie in one ruling, the cyan trisecant 
line in the other ruling. 


7.3.2 Structure of the set of trisecants. 


Using the ideas from above, we can piece together the structure of the set of trisecants 
JT c K?. Indeed this approach has been used to understand (and compute) J and 
the set of quadrisecants in many papers (such as [3, 12, 16, 36, 51]). We expect that 
generically, J is a 1-manifold. Of course, we can add more (generic) conditions to Def- 
inition 73.2 to give even more control over J (for example those in [16, 51]). We omit 
these details here, and also the many technical but straightforward details needed to 
prove the following two results. 


Proposition 7.3.3 (c.f. [16, 51]). Let K be a nontrivial generic polygonal tame knot. In 
K?, J is a compact 1-manifold with boundary, embedded in K? in a piecewise smooth 
way with T c K? \ Aand oT c A. Moreover, each component of J is either a simple 
closed curve or a simple open arc. 


Proposition 7.3.4 (c.f. [16]). Let K be anontrivial generic polygonal tame knot. The pro- 
jection mj; : K? - K? (i< j andi, j = 1, 2, 3) is a piecewise smooth immersion of J into 
the set of secants, and T = ny (T. ) intersects itself (transversally) at double points. 


The astute reader will have realized that we really need to work with essential trise- 
cants if we are to find essential (alternating) quadrisecants. Recall that the set of es- 
sential secants ES is a subset of S. In order to find essential alternating quadrisecants, 
we restrict our attention to the trisecants abc which are essential in the second seg- 
ment bc. Thus €T = 154(ES)NT, and we define €T 4 and €7" to be the sets of essential 
trisecants of direct and reversed orderings in K?. Since we are interested in finding 
essential trisecants which share the first two points, we project €J to Sina particular 
way. 
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Definition 7.3.5 ([16]). Let ET = 712(€7) be the projection of the set of essential trise- 
cants to the set of secants S and similarly define E T? := 14 2(ET a) and ET” := 142(€9"). 


We can easily prove that a version of Proposition 73.4 holds for ET: For a nontrivial 
generic polygonal tame knot, ET is a piecewise immersed 1-manifold which intersects 
itself transversally at double points ([16]). Finally, we observe that (essential) alter- 
nating quadrisecants occur precisely when direct and reversed (essential) trisecants 
intersect in S. 


Lemma 7.3.6 ((16]). Let ab ¢ ET¢N ET" inS. This means that there exists c and d such 
that abc € €3" and abd € E74. Then either abcd or abdc is an essential alternating 
quadrisecant. 


The proof of the existence of alternating quadrisecants, Theorem 7.2.8, follows the 
same pattern as the proof of Lemma 73.1. We proceed by contradiction, and assume 
that ET? ET’ = (in S. We then show that K has to be unknotted, a contradiction. 
The details of proving that K is unknotted are quite involved and so we omit them here. 
[However, very briefly, since K is a polygonal knot, we can show that ET“, respectively 
ET’, stay at least the minimum edge length away from the bottom, respectively top, 
boundary of S. Since ET4 and ET’ do not intersect, we use a Meyer-Vietoris argument 
to construct a loop winding once around S which avoids the set of essential trisecants. 
We then use this loop to find a spanning disk whose boundary is the knot. After further 
technicalities, the loop theorem is invoked to show that this disk is embedded.] 


7.4 Applications of essential secants and 
quadrisecants 


7.4.1 Total curvature 


For smooth closed curves, the total curvature can be thought of as the total angle 
through which the unit tangent vector turns (or the length of the tangent indicatrix). 
J.W. Milnor [39] defined the total curvature x(y) of an arbitrary closed curve y to be 
the supremal total curvature of inscribed polygons (where the total curvature is the 
sum of the exterior angles), and showed the two definitions are equivalent. In 1929, 
W. Fenchel [23] proved that the total curvature of a closed curve in R? is greater than 
or equal to 27, equality holding only for plane convex curves. In 1947, K. Borsuk [4] 
extended this result to R” and conjectured the following. 


Theorem 7.4.1. A nontrivial tame knot in R? has total curvature greater than 4n. 
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This result was first proved around 1949 by both Milnor [39] and Fary [22]. It has since 
become known as the Fary-Milnor theorem. In Cantarella et al. [10], the theorem was 
proved using the nonemptiness of the second hull for a knotted curve (defined be- 
low). In 1998, the Fary-Milnor theorem was independently extended to knotted curves 
in Hadamard” manifolds by Schmitz [48] and S.B. Alexander and R.L. Bishop [2]. In- 
terestingly, in their proof, Alexander and Bishop show that a simple curve of finite 
total curvature is isotopic to an inscribed (geodesic) polygonal curve, fixing a mistake 
in [39]. They show that for any nontrivial knot K, some doubly covered bigon is in- 
scribed in a polygon inscribed in K. This doubly covered bigon is an idea somewhat 
close to the midsegment of an alternating quadrisecant. 

A new proof of the Fary-Milnor theorem is given by the existence of alternating 
quadrisecants for any nontrivial tame knot K. An alternating quadrisecant can be 
thought of as an inscribed polygon in K, and has total curvature 47. Thus by defi- 
nition, x(K) = 47. To get a strict inequality, simply observe that a knot is not coplanar. 

More recently, H. Gerlach, P. Reiter, and H. von der Mosel [26] have written about 
elastic knots, which are limit configurations of energy minimizers of an energy consist- 
ing of the classic bending energy and a small multiple of ropelength (defined below). 
One of the many results in this paper is an extension of the classic Fary-Milnor theo- 
rem on total curvature to the C'-closure of the knot class. The proof of this result also 
relies on the existence of alternating quadrisecants. 


7.4.2 Second Hull 


The convex hull ofa connected set K in R? is characterized by the fact that every plane 
through a point in the hull must intersect K. If K is a closed curve, then a generic 
plane must intersect K an even number of times. Thus every generic plane through 
each point of the convex hull is cut by K at least twice. In proving the total curvature 
result above, Milnor observed that for a nontrivial tame knot, there are planes in ev- 
ery direction which cut the knot four times. More generally, there are points through 
which every plane cuts the knots four times. This idea was formalized by Cantarella, 
Kuperberg, R. Kusner and J.M. Sullivan in [10], where they defined these points as the 
second hull of a knot. 


Definition 7.4.2 ([10]). Let K be a closed curve in R?. Its nth hull hy(K) is the set of 
points p € R? such that K cuts every generic plane P through p at least 2n-times. 


Cantarella et al. [10] proved that the second hull of a nontrivial tame knot in R? is 
nonempty. This paper also conjectured the existence of alternating quadrisecants for 


7.2 A Hadamard manifold is a complete simply-connected Riemannian manifold with non-positive 
sectional curvature. 
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nontrivial tame knots in R?. The existence of alternating quadrisecants gives another 
way of proving that the second hull is nonempty. This is because the mid-segment bc 
of alternating quadrisecant abcd is in the second hull. 


7.4.3 Ropelength 


The ropelength problem asks to minimize the length of a knotted curve subject to 
maintaining an embedded tube of fixed diameter around the tube; this is a mathe- 
matical model of tying the knot tight in a rope of fixed thickness. 

More technically, the thickness T(K) of a space curve K is defined (O. Gonzalez and 
J.H. Maddocks [27]) to be twice the infimal radius r(a, b, c) of circles through any three 
distinct points of K. It is known ((11]) that t(K) = 0 unless K is C1’! (meaning that its 
tangent direction is a Lipschitz function of arclength). When K is C', we can define 
normal tubes around K, and then indeed 1(K) is the supremal diameter of such a tube 
that remains embedded. We note that in the existing literature thickness is sometimes 
defined to be the radius rather than diameter of this thick tube. Following others lead, 
we define the ropelength as follows. 


Definition 7.4.3. The ropelength of a knot K is the (scale-invariant) quotient of length 
over thickness, Rop(K) = len(K)/tT(K). 


Cantarella, Kusner and Sullivan [11] proved that any (tame) knot or link type has a ro- 
pelength minimizer and gave certain lower bounds for the ropelength of links; these 
are sharp in certain simple cases where each component of the link is planar. In this 
section we outline our joint work [14], showing how essential alternating quadrise- 
cants are used to prove that nontrivial knots have ropelength at least 15.66. This is an 
improvement on the bound of 12 from Y. Diao [18] and is greater than the conjectured 
bound of 15.25 from [11]. 


7.4.3.1 Ropelength basics 


Because the ropelength problem is scale invariant, we find it most convenient to 
rescale any knot K to have thickness (at least) 1. This implies that K is a C!! curve 
with curvature bounded above by 2. For any point a € R’, let B(a) denote the open 
unit ball centered at a. We now give several well known results about the local struc- 
ture of thick knots (see for instance [19, 11] and [14]). Most proofs are elementary and 
we omit them here. 


Lemma 7.4.4. Let K be a knot of unit thickness. If a € K, then B(a) contains a single 
unknotted arc of K; this arc has length at most 7 and is transverse to the nested spheres 
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centered at a. If ab isa secant of K with |a-b| < 1, then the ball of diameter ab intersects 
K ina single unknotted arc (either 7, OY Ypq) whose length is at most arcsin |a — b|. 


As an immediate corollary, we see that if K has unit thickness, a, b € K and p € yp 
with a, b ¢ B(p), then the complementary arc +, lies outside B(p). 


Lemma 7.4.5. Let K be a knot of unit thickness. If a € K, then the radial projection of 
K \ {a} to the unit sphere 0B(a) does not increase length. 


Lemma 7.4.6. Suppose K has unit thickness, and p,a,b € K with p ¢ yqp. Let Zapb 
be the angle between the vectors a - p and b - p. Then £gy = Zapb. In particular, if apb 
is a reversed trisecant in K, then 4) = 71. 


Given a thick knot K with quadrisecant abcd, we can bound its ropelength in terms of 
the distances along the quadrisecant line. Whenever we discuss such a quadrisecant, 
we will abbreviate these three distances as r := |a — b|, s := |b-c| and t := |c— d|. We 
start with some lower bounds for r, s and t for alternating quadrisecants. 


Lemma 7.4.7 ((14] Lemma 4.2). If abcd is an alternating quadrisecant for a knot of unit 
thickness, then r = 1 and t = 1. With the usual orientation, the entire arc qq thus lies 
outside B(b) U B(c). If s = 1 as well, then yac lies outside B(b) and ‘pq lies outside 
B(c). 


As suggested by the discussion above, we will often find ourselves in the situation 
where we have an arc of a knot known to stay outside a unit ball. We can compute 
exactly the minimum length of such an arc in terms of the following functions. 


Definition 7.4.8. For r > 1, let f(r) := Vr2 - 1+arcsin(1/r). For r,s > 1 and @ € [0, 7], 
the minimum length function is defined by 


Vr2+s52-2rscos@, if @< arccos(1/r) + arccos(1/s). 


m(r, Ss, 9) = 
f)+f(s)+(@-7), if @ = arccos(1/r) + arccos(1/s). 


The function f(r) will arise again in other situations. The function m was defined ex- 
actly to make the following bound sharp: 


Lemma 7.4.9. Any arc y from a to b staying outside B(p) has length at least 


fap = m(|a - p|, |b - p|, Zapb). 


An important special case is when @ = 7. If a and b lie at distances r and s along 
opposite rays from p (so that Zapb = 7) then the length of any arc from a to b avoiding 
B(p) is at least 


Lap 2 f(r) + f(s) = Vr2 - 1+ arcsin(1/r) + Vs2 — 1+ arcsin(1/s). 
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Fig. 7.6: The shortest arc from a to b avoiding the ball B(p) consists of straight segments and an arc 
of the ball. 


7.4.3.2 Length of essential secants and quadrisecants 


We will improve our previous ropelength bounds by getting bounds on the length of 
an essential arc. Intuitively, we expect an essential arc y,) of a knot to “wrap at least 
halfway around” some point on the complementary arc 7,,. Although when |a-b| = 2 
we can have ¢,, = 7, when |a — b| < 2 we expect a better lower bound for ¢,,. Even 
though in fact an essential +,, might instead “wrap around” some point on itself, we 
can still derive the desired bound. 


Lemma 7.4.10 ([14]). If yqp is an essential arc in a knot K of unit thickness, then |a-—b| = 
1 and ly = g(|a — b\), where 


2arcsin(|a-b|/2), ifOs|a-b| <2. 

g(a - bl) = ; 

Tl, if |a- b| = 2. 

Proof. (Sketch) If |a — b| < 1, then by Lemma 74.4 the ball of diameter ab contains a 
single unknotted arc of K, and thus +,,, is inessential. 

Knowing that sufficiently short arcs starting at any point a are inessential, con- 
sider the shortest arc aq which is essential. From Theorem 7.2.6 there is a trisecant 
apq with both secants ap and pq essential, thus a and q are outside B(p). Since ap is 
essential, apg is reversed and by Lemma 7.4.6 we get lg, > faq = 71. 

Note that |a — b| € [1, 2], so 2m - arcsin(|a — b|/2) < 52/3. Considering again 
‘aq With reversed trisecant apg, we have b ¢ aq and faq = 71, SO We may assume 
fqp < 21/3 or the bound is trivially satisfied. 

The remainder of the proof involves finding a bound on £,) using the fact that yqp 
is essential. There are two cases. The first is where b ¢ B(p) and the whole arc ygqp 
stays outside B(p). Thus @,, is greater than the length of the radial projection of the 
arc onto OB(p), giving the result. The second case is where b € B(p). Here, we let yay 
be the shortest essential arc starting at g and use a short argument (omitted) to show 
that 2.) > 57/3. 
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Fig. 7.7: The projection of yg, to the ball B(p) does not increase the length 2,5. 


It is now straightforward to compute a lower bound of the ropelength of a unit thick- 
ness knot K with an essential quadrisecant abcd. There are three cases, depending 
on the type of quadrisecant. 


1. When K has an essential simple quadrisecant, it has length 4, + ln¢ + ca + lag 2 
(g(r) + f(r) + (g(s) + s) + (g(t) + f(0). By minimizing each term separately, we find 
that K has ropelength at least 1077/3 + 2/3 +2 > 15.936. 

2. When K has an essential flipped quadrisecant, it has length ¢,) + nq + lac + ca 2 
(g(r) + f(r) + 2f(s) + (g() + f(0). By minimizing each term separately, we find that 
K has ropelength at least 1077/3 + 2\/3 > 13.936. 

3. When K has an essential alternating quadrisecant, the length of K is lact+£-,+pqt 
laa 2 2f(r) + (2f(s) + g(s) + s) + 2f(t). Again, by minimizing each term separately, 
we find K has ropelength at least 15.66. 


Together with Theorem 7.2.8, we conclude the following. 
Theorem 7.4.11 ((14]). Any nontrivial knot has ropelength at least 15.66. 


Several independent numerical simulations (see for instance [46, 52]) have found 
a trefoil knot with ropelength less than 16.374, which is presumably close to the min- 
imizer. This is about 5% greater than our bound, so there is not much room for im- 
provement, although a careful analysis based on tangent directions at b and c could 
yield a slightly better bound. 

Asa final remark, we note that the ropelength problem is still open for all knot and 
most link types — it is a rich source of open questions. For example, there are many 
results relating ropelength to other knot invariants, in particular to crossing number, 
for instance [5, 7, 20, 17, 21, 32]. More recently, there have been several papers giving 
a set of necessary and sufficient conditions for ropelength criticality, for example [49, 
8, 53, 9]. 
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7.4.4 Distortion 
Gromov introduced the notion of distortion for curves (see [28, 29, 30]). 


Definition 7.4.12. If 7 is a rectifiable curve in R?, then its distortion is defined to be 
the quantity: 
d,(p, q) 
6(y) = sup —LP* 21. 
- Bt dg3(p, q) 


where d., denotes the arclength along 7 and dg; denotes the Euclidean distance in R?. 


Gromov showed that for any simple closed curve 7, we have 6(y) = 51, with equality 
if and only if 7 is a circle, thus determining 6(unknot). He then asked whether every 
knot type can be built with say, 6 < 100. In [15], we proved that for any nontrivial tame 
knot we have 6(K) = 31. To do this, we first showed that any nontrivial tame knot has 
a shortest essential secant. Then, borderline-essential arcs and their lengths (namely 
Theorem 7.2.6 and Lemma 74.9) were key tools used in our distortion computations. 
Our bound is of course not sharp, but numerical simulations [41] have found a trefoil 
knot with distortion less than 7.16, so we are not too far off. We expect the true mini- 
mum distortion for a trefoil is closer to that upper bound than to our lower bound. 

In 2011, J. Pardon [45] proved that the distortion of Ty,g, a (p, q) torus knot, is 
6(Ty,q) = 1a min(p, q). This shows the answer to Gromov’s question is no. Pardon’s 
main theorem considers isotopy classes of simple loops in a piecewise-linear embed- 
ded surface of genus g = 1, and his inequality involves certain minimum geometric 
intersection numbers of these loops. 

In general, distortion is quite tricky to get a handle on. In [15] we give an example 
of a wild knot, the connected sum of infinitely many trefoils, with distortion less than 
10.7. Unlike the ropelength case, having finite distortion does not put any regularity 
conditions on the curve. Because of this, it is an open problem to establish the exis- 
tence of minimizers of 6 in any nontrivial knot class. In 2007, C. Mullikin [42] started to 
develop a calculus of variations theory for distortion. However, it is not clear whether 
the techniques developed for ropelength criticality can be easily applied to this situa- 
tion. 


7.4.5 Final Remarks 


Quadrisecants have made an appearance in other parts of knot theory as well. For ex- 
ample, they give a starting place to finding information about two “super-invariants” 
of knots. 

Recall that the bridge index of a knot is defined to be the minimum number of 
bridges in all possible diagrams of a knot in its knot type. The superbridge index was 
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first defined by N.H. Kuiper [37], and is 


sb|K] = min max bAK), 
K’e€[K] ves? 

where b;(K) is the number of bridges (or local maxima) of an orthogonal projection 
K — Rv. Kuiper then computed the superbridge index for all torus knots. Later, [33, 34] 
used quadrisecants to show that there are only finitely many knot types with super- 
bridge index 3. 

The supercrossing index of a knot is related to crossing index in the same way that 
superbridge index is related to bridge index. Namely, the supercrossing index of K is 


sc[K] = min max(# of crossings). 
Ke[K] ves? 


More simply, we maximize the number of crossings we see generated by a given con- 
formation of a knot, and then minimize over all conformations. This invariant was 
studied by C. Adams et al. [1] and was related to other knot invariants like stick index. 
The existence of quadrisecants means that every nontrivial tame knot has supercross- 
ing index at least 6. To see this, simply perturb the projection along the quadrisecant 
line to find six crossings. 

In summary, quadrisecants and essential secants appear in many different parts 
of knot theory. They are a useful tool in understanding many phenomena of knots 
and links, as they form a bridge between topological and geometric properties. For 
all this, quadrisecants are still not completely understood. One of the more important 
open problems is to give bounds on the number of quadrisecants for knot and link 
types. 
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Gyo Taek Jin 
Polygonal approximation of unknots by 
quadrisecants 


Abstract: If a knot K in R? has only finitely many quadrisecants which meet K at 
finitely many points, we use these points to form a polygonal approximation called 
the quadrisecant approximation which is denoted by K. The quadrisecant approxima- 
tion conjecture states that K has the knot type of K if K is nontrivial. We give examples 
of unknots, some polygonal and some smooth, having finitely many but at least two 
quadrisecants, for which the conjecture holds. 


Keywords: knots, quadrisecants, polygonal approximation 
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8.1 Introduction 


A knot K is the image of a locally flat embedding of S! in R?, i.e., we do not consider 
wild knots. A quadrisecant of a knot K is a straight line which meets K at four distinct 
places. The existence of quadrisecants for nontrivial knots was first announced by 
Pannwitz in 1933 for polygonal knots and later by Morton and Mond for smooth knots 
and by Kuperberg for tame knots. 


Fig. 8.1: A quadrisecant 


Theorem 8.1.1 ({10, 9, 8]). 
Every nontrivial knot has a quadrisecant. 


It is not hard to see that every knot can be perturbed to have infinitely many quadrise- 
cants. On the other hand, every knot can be perturbed to have only finitely many 
quadrisecants which meet the knot at finitely many points. 
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Theorem 8.1.2 ((1, 6]). 
A polygonal knot satisfying the following conditions has only finitely many quadrise- 
cants®", 

1. No four vertices are coplanar. 

2. No three edges are parallel to a single plane. 

3. No four edges lie on a single hyperboloid of one sheet. 


Definition 8.1.3. A polygonal knot is said to be in general position if it satisfies the 
three conditions mentioned in Theorem 8.1.2. 


8.2 Quadrisecant approximation of knots 


Definition 8.2.1. Let a : [a, b] + R? bea parametrization of a knot K and let a < t; < 
+++ <tn = to < b, for some n = 3. Then the polygonal closed curve 


K = alto)a(ty) U a(ty)a(tz) U +++ U a(ta_1)a(tn) 


is called a polygonal approximation, or simply, an approximation of the knot K. It may 
happen that K’ has one or more self intersection points. We say that the polygonal 
approximation K’ of K is a good approximation if K’ is a knot which has the knot type 
of K. 


Definition 8.2.2. Suppose that K is a knot having only finitely many quadrisecants 
which intersect K in finitely many points. The polygonal approximation of K using 
the points on the quadrisecants is called the quadrisecant approximation of K and is 
denoted by K. 


Fig. 8.2: A good approximation and a bad approximation 


8.1 Possibly no quadrisecants in the case of an unknot 
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Fig. 8.3: The quadrisecant approximation of a smooth trefoil knot 


Figure 8.3 shows that the quadrisecant approximation of a smooth trefoil is a good 
approximation and Figure 8.4 shows that the quadrisecant approximations of a polyg- 
onal trefoil and a polygonal figure eight knot are good approximations. 
Conjecture 8.2.3 (Quadrisecant approximation conjecture®’). 
Let K be a nontrivial knot having only finitely many quadrisecants which intersect K in 
finitely many points. Then 

1. Kisa good approximation of K. 

2. The quadrisecants of K are those of K. 


Theorem 8.2.4 ((7, 11]). 

The quadrisecant approximation conjecture holds for 
1. hexagonal trefoil knots and 
2. heptagonal figure eight knots in general position. 


Ay, $4 


Fig. 8.4: The quadrisecant approximations of a polygonal 3; and a polygonal 4, 


8.2 In the original statement of the conjecture in [6], the word ‘nontrivial’ is missing. 
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8.3 Quadrisecants of Polygonal Unknots 


In this part, we examined the part 1 of the quadrisecant approximation conjecture for 
133 polygonal unknots of which the vertex data were supplied by Claus Ernst [4]. It is 
explained in Section 8.5 how we obtained quadrisecants, and in Section 8.6 how we 
confirmed quadrisecant approximations are good. 


Example 8.3.1. We examined thirty 10-gonal unknots. They have 9.1 quadrisecants 
in average. Except the four cases of less than 2 quadrisecants, the quadrisecant ap- 
proximations are unknots. The quadrisecant approximation is not defined or not em- 
bedded if there exist less than two quadrisecants. Figure 8.6 shows two unknots which 
have no quadrisecant and only one quadrisecant. In each of Figures 8.7 and 8.8, the 
quadrisecants are shown on the left and the quadrisecant approximation is shown on 
the right. 


Io 
NR 


jo) 


10 20 30 


Fig. 8.5: Distribution of quadrisecants of thirty 10-gonal unknots 


Fig. 8.7: A 10-gonal unknot with 9 quadrisecants and its quadrisecant approximation 


Fig. 8.6: 10-gonal unknots with 0 and 1 quadrisecant 
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Fig. 8.8: A 10-gonal unknot with 28 quadrisecants and its quadrisecant approximation 


Example 8.3.2. We examined thirty 20-gonal unknots. They have 305.5 quadrise- 
cants in average. All quadrisecant approximations are unknots. Figures 8.10 and 8.11 
show two of them. 
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Fig. 8.9: Distribution of quadrisecants of thirty 20-gonal unknots 


Fig. 8.10: A 20-gonal unknot with 83 quadrisecants and its quadrisecant approximation 
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Fig. 8.11: A 20-gonal unknot with 764 quadrisecants and its quadrisecant approximation 


Example 8.3.3. We examined thirty 30-gonal unknots. They have 1577.4 quadrise- 
cants in average. All quadrisecant approximations are unknots. Figure 8.13 shows one 
of them; the knot, its quadrisecants, and its quadrisecant approximation with those 
quadrisecants incident to the new vertices. 
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Fig. 8.12: Distribution of quadrisecants of thirty 30-gonal unknots 


Fig. 8.13: A 30-gonal unknot with 1,431 quadrisecants and its quadrisecant approximation 
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Example 8.3.4. We examined thirty 40-gonal unknots. They have 5317.1 quadrise- 
cants in average. All quadrisecant approximations are unknots. Figure 8.15 shows one 
of them. 
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Fig. 8.14: Distribution of quadrisecants of thirty 40-gonal unknots 


Fig. 8.15: A 40-gonal unknot with 6,275 quadrisecants and its quadrisecant approximation 


Example 8.3.5. We examined thirteen 50-gonal unknots. They have 13161.4 
quadrisecants in average. All quadrisecant approximations are unknots. Figure 8.17 
shows one of them. 
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Fig. 8.16: Distribution of quadrisecants of thirteen 50-gonal unknots 
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Fig. 8.17: A 50-gonal unknot with 15,920 quadrisecants and its quadrisecant approximation 


8.4 Quadrisecants of Smooth Unknots 


We constructed seven smooth unknots to examine the part 1 of the quadrisecant ap- 
proximation conjecture. It is explained in Section 8.5 how we obtained quadrisecants, 
and in Section 8.6 how we confirmed quadrisecant approximations are good. 


Example 8.4.1. Figure 8.18 shows a smooth unknot, its quadrisecants and the 
quadrisecant approximation. With a = 1, 000 random choices of four points on this 
knot, we performed Newton’s method and obtained b = 184 quadrisecants. Reduc- 
ing duplicates, we obtained c = 99 quadrisecants as shown in the middle. Among the 
a-b = 816 failures, d = 27 cases diverged and e = 789 cases converged with less than 
four secant points. The approximation shown on the right is a good approximation. 


Fig. 8.18: A smooth unknot with 99 quadrisecants and its quadrisecant approximation 
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Example 8.4.2. Figure 8.19 shows a smooth unknot. We performed Newton’s method 
with a = 1,000, b = 143, c = 107, d = 28, and e = 829 as in Example 8.4.1, and 
obtained a good approximation. 


Fig. 8.19: A smooth unknot with 107 quadrisecants and its quadrisecant approximation 


Example 8.4.3. Figure 8.20 shows a smooth unknot. We performed Newton’s method 
witha = 1, 000, b = 54, c = 31,d = 19, ande = 927 asin Example 8.4.1, and obtained 
a good approximation. 


Fig. 8.20: A smooth unknot with 31 quadrisecants and its quadrisecant approximation 
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Example 8.4.4. Figure 8.21 shows the smooth unknot parametrized by 


x = (5 cos3t+4+cos2t) cos6t 
y = (5 cos3t+4+cos2t) sin6t O<t<2n 
Z=4 


sin2t-} sin3t 


We performed Newton’s method with a = 1, 000, b = 129, c = 51, d= 6, ande = 865 
as in Example 8.4.1, and obtained a good approximation. 


Fig. 8.21: A smooth unknot with 51 quadrisecants and its quadrisecant approximation 


Example 8.4.5. Figure 8.22 shows the smooth unknot parametrized by 


= (5 cos5t+4+cos3t)cos9t 


x= ( 
y= (4 cos5t+4+cos3t)sn9t Osts<2n 
z=4sin3t- 5 sin5t 

We performed Newton’s method with a = 1,000, b = 152, c = 121, d = 44, and 


e = 804 as in Example 8.4.1, and obtained a good approximation. 


Fig. 8.22: A smooth unknot with 121 quadrisecants and its quadrisecant approximation 
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Example 8.4.6. Figure 8.23 shows the smooth unknot parametrized by 
x = (5 cos8t+4+cos3t) cos12t 
y= (35 cos8t+4+cos3t)sinl2t O<t<2a 
z=4sin3t- 5 sin8t 


We performed Newton’s method with a = 1,000, b = 217, c = 197, d = 37, and 
e = 746 as in Example 8.4.1, and obtained a good approximation. 


Fig. 8.23: A smooth unknot with 197 quadrisecants and its quadrisecant approximation 


Example 8.4.7. Figure 8.24 shows the smooth unknot parametrized by 


x = (5 cos7t+4+cos2t) cos 10t 
y= (} cos7t+4+cos2t)sinl0t O<t<2a 
z=4sin2t- 5 sin7t 


We performed Newton’s method with a = 1,000, b = 145, c = 117, d = 46, and 
e = 809 as in Example 8.4.1, and obtained a good approximation. 


Fig. 8.24: A smooth unknot with 117 quadrisecants and its quadrisecant approximation 
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8.5 Finding Quadrisecants 
For a polygonal knot K, we parametrize four distinct edges by 
a, B, y,6:[0, 1) > R? 


which have constant tangent vectors coherent with an orientation of K. A line passing 
through a(t,) and B(t2) can be parametrized by 


qs) = (1-s)a(ti) + sB(tz2), seER. (8.5.1) 


It is a quadrisecant of K if it also intersects 7 and 6. If the system 


q(s3) = (1 - s3)a(ty) + $3B(t2) = (ts) 
q(s4) = (1 - Sa) a(t1) + S4B(t2) = 6(ts) 


can be solved for the six variables s3, s4 € R, fy, tz, t3, tg € [0, 1) then each solution 
determines a quadrisecant parametrized by (8.5.1). 


For a smooth knot K, we consider a parametrization by a differentiable map 
f : [a,b] — R? which is one-to-one for a < t < band f(a) = f(b). If there exist 
t1, t2, t3, t, € [a, b) such that the four points f(t,), f(t), F(ts), f(t4) are distinct and 
collinear, then there exist nonzero real numbers A and p satisfying 


F(ts) — f(t) = AF (2) - Ft), F(ta) - Flt) = w(t) - F(t) 


Therefore, finding a quadrisecant of K is equivalent to finding a zero of the vector val- 
ued map F = (F,, F), F3, Fy, Fs, Fg) : R®° > R® given by 

Fy(ti, ta, tz, ty, A, W) = x(t3) —Ax(t2) + (A- I)x(tr) 

Fo(t1, ta, t3, ty, A, u) = x(ty) — wx(t2) + (u- 1)x(tr) 

F3(t1, tz, t3, ty, A, W) = y(ts) — Ay(ta) + A- Dy(tr) 

Fy(ty, tz, t3, ta, A.W) = y(t4) - wy(ta) + (u- Dy(t) 

F5(t1, tz, 3, ty, A, uw) = 2(t3) —Az(to) +A - 1)2(tr) 

Fo(ti, t2, t3, ty, A, uw) = 2(ty) — w2(t2) + (u - 1)z(t) 
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where f(t) = (x(0), y(6), z() for t € [a, b]. With an initial guess vo € R° fora solution 
of the equation F(v) = 0, the sequence 


Vn+1 = Vn - (De(Wn)) *F(wn) 


may converge to a solution. This is Newton’s method we used to find quadrisecants in 
Examples 6-12. 


8.6 Test for Good Approximations 


Let K be a polygonal knot. If there is a triangular disk A which intersects K along two 
of its edges and nowhere else, then the portion of K along the two edges of A can be 
replaced by the third edge of A without changing the knot type. Such a move is called 
a triangular move if A contains three vertices of K, and a corner move if A contains a 
single vertex of K. If A contains two vertices of K, then it can be described as a combi- 
nation of a corner move followed by a triangular move. 


S » 
> > 


After finding each quadrisecant approximation, we obtained a sequence of trian- 
gular moves and corner moves to deform the polygonal curve into a triangle; which 
shows that the quadrisecant approximation is an unknot. We first scanned through 
the vertices at which the corner moves can be performed. If we ran out of such ver- 
tices, we broke the edges to perform corner moves. Such processes were successfully 
performed to result in triangles. 

We used Maple 18.02 to draw knots and unknots, to find quadrisecants, to con- 
struct quadrisecant approximations, and to find sequences of triangular moves and 
corner moves to show quadrisecant approximations are unknots. 


Fig. 8.25: A triangular move 


Fig. 8.26: A corner move 
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Appendix 
The vertices of the 10-gonal unknots of Figure 8.6 


(Left Figure) 

[0, 0, 0], 

[0.26079016715637726, —0.6651136228131733, 0.6997230576898867], 
[0.3851146726381711, 0.32376039919446953, 0.6180432809080222], 
[0.29607875594678595, —0.2674375016725 7857, —0.18355365493275347], 
[-0.5165348788026222, —0.4202409485147839, -0.7459682593417223], 
[-0.2605699687081926, 0.5310040460410358, -0.5738783841809526], 
[-0.21143605902512672, 0.489780688565 2303, 0.4240627431996551], 
[-0.9792224753363977, 0.030222606418956677, —0.02237917694442637], 
[-0.3139988528585777, 0.30446438577681256, 0.6720767097410592], 
[0.6397169293922946, 0.5647064005590116, 0.5214105210067966] 


(Right Figure) 

[0, 0, O], 

[0.8284426241853787, —0.13711967475775255, -0.5430294773096314], 
[0.45649991860657 285, 0.5694289821222773, 0.05900675019478009], 
[0.5844719512915223, -0.22266670353456428, —0.5378245 355253136], 
[-0.20529732546788465, 0.28931020656749296, —0.19997230005946198], 
[-0.5958278971220673, 0.28647492442354117, 0.7206132927990302], 
[0.3237832719482289, 0.3557541454361506, 0.3339407087475498], 
[-0.5730602105134658, 0.51354129523 70683, -0.07930859049406143], 
[-0.5605933955343954, —0.3916939036304804, 0.3454193380148927], 
[0.33521831640868194, —0.5627213045 367457, 0.7556278275477895] 
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The vertices of the 10-gonal unknot of Figure 8.7 


[0, 0, O], 

[0.2599759088879847, 0.7063479811963731, —0.658395820352525], 
[0.3232460020389661, —0.0708267 405320456, —0.03229970375458519], 
[-0.3890093612580253, -0.577239084649801, 0.45374436336261653], 
[0.5985360102269767, -0.43015793235373406, 0.39787600113666116], 
[-0.335615146452009, -0.4896887492626049, 0.7497533921601192], 
[0.16214824879081988, 0.2326112149861075, 0.269634277770117], 
[-0.17941933018462342, 0.5192489431721007, -0.6254469485849145], 
[0.10970548156622802, —0.073986685 30301361, 0.1258707820309488], 
[0.2805 221340395753, 0.872216847407 6153, 0.4006807 9990461647] 


The vertices of the 20-gonal unknot of Figure 8.10 


[0, 0, O], 

[0.8482169711790621, 0.40732568763520016, —0.338546531508343], 
[-0.08769695 736784196, 0.0951362437632568, —0.501650392550402], 
[-0.41095356729969773, —0.3391937227912085, 0.33910084550290054], 
[0.165599246487 20026, -0.2840335049210709, -0.4760949612079347], 
[0.5670984753772971, 0.3329930052222135, 0.2007186431035161], 
[-0.11690921121550236, -0.21262685423432814, 0.684900895616545], 
[-0.05162718976454717, 0.670660755319105, 0.22063668583437912], 
[0.22992403518773627, 0.24603905626447556, -0.6398430178308662], 

[-0.545 1058080348717, 0.13296592839047053, -0.01811702523675838], 
[-0.3829928667 3662986, —0.29010695 29831363, 0.8733585668631414], 
[0.07820219100325038, -0.5735108573538344, 0.032536739301705614], 
[-0.8321749557919547, -0.1981170757345217, -0.1415119875381466], 
[-0.15270625050450992, -0.7460520028999029, -0.6294562406319523], 
[-0.658560049051659, 0.0931762686547083, -0.4299372121688003], 
[-0.3760613614914784, 0.4254692083888686, 0.46993827573472363], 
[-0.18227772625757413, 0.6421489541499391, -0.48687821162972966], 
[0.6200378545 169324, 0.059516981894417724, -0.6166075211634959], 
[0.15289516739144102, -0.8197458643172172, -0.5234702185560548], 
[0.4288551720589901, 0.0641356787 1531958, -0.9010936999635384] 
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The vertices of the 30-gonal unknot of Figure 8.13 


[0, 0, O], 

[0.9559164467 618607, -0.25443934965439446, 0.1465754555088525], 
[0.02804655254704769, -0.3985822747 142186, -0.19734354158928644], 
[0.2230273288845479, 0.34936016038878287, 0.43713625370053855], 
[-0.1423832631217396, 0.7875966440994483, —0.3840968944337329], 
[0.30538251681348216, -0.10105737443120202, -0.4830910441009188], 
[0.3012109144597282, 0.8341007496167119, -0.12888505775824197], 
[0.3461536690536176, 0.2106052374462082, -0.9094191666303365], 
[0.13644196659122435, 0.867317947629769, —-0.18502326787490653], 
[0.6159415857389964, 0.06346331321990045, -0.5370145205715016], 
[-0.07459829470376207, -0.5849247 254798481, -0.21647181703702034], 
[-0.6456744097860264, -0.11645100799238642, 0.4576234631764124], 
[0.013065275565096495, 0.6158901155248228, 0.6300720228951164], 
[0.3257452740151201, —0.3163186077957065, 0.44781364933054646], 
[-0.3692232349170414, 0.3650512940457395, 0.21813163336676566], 
[0.5962648581644622, 0.522606432033769, 0.4255180374809578], 
[0.4216649068013401, -0.4357824399672629, 0.199673783605449 16], 
[0.6891481900165033, 0.5134219687 382546, 0.03395207804324586], 
[0.48005 323271764483, —0.349863278345 78396, 0.4933189042282012], 
[-0.07222465637064997, -0.58697 13688224925, -0.30591127954532604], 
—0.1301082479344901, 0.2085509519106162, 0.29724187434104754], 
0.485784516741387, 0.19721675867335445, -0.49050658708783085], 
-0.41900114810318617, 0.2874375363705996, -0.0743057 7982832054], 
—0.5142445788646942, -0.6836888594256834, —0.2930344068630529], 
-0.136206867 45409896, -0.62007 14061477229, 0.6305674058191582], 
[0.49847904544111116, —0.324346799305082, -0.0833798953734013], 
[0.1741444901061093, 0.6016937970515704, -0.2763977963957214], 
[-0.7272768925103164, 0.5408356549693376, 0.1522463188143096], 
[-0.24741374056874957, -0.3060880701790464, 0.38126937265275906], 
[0.646115226201013, —0.7282664692363255, 0.2283923471950623] 
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Open knotting 


Abstract: Traditional topological knot theory mainly studies knotting within closed 
curves. However, much of what people consider knotting in everyday life, e.g. in 
shoelaces, involve tangled open rope-like materials with two free ends. Furthermore, 
knotting has been discovered in open DNA and protein chains. Several notions of open 
knotting have been defined and applied in different situations. In this paper, we sur- 
vey the different techniques used for measuring knotting in open chains, including 
some discussion of different properties of the techniques. We present the knotting fin- 
gerprint, a means for visualizing the knotting of subchains of open and closed chains 
via image matrices. Finally, we show image matrices for some random closed knots 
and relate features of the matrices with features of the configurations. 


9.1 Introduction 


From tying shoelaces to detangling headphone cables, knotting is ever-present in 
modern day life. Knotting can be decorative (as in Celtic art and architecture), pro- 
vide utility (as in boating applications and the manufacturing of clothing), and be 
used for safety (as in recreational climbing and surgical sutures). In addition to the 
human scale, knotting is present at microscopic scales (in DNA [35, 37] and proteins 
[3, 18, 26, 38, 44]), and may even be present at the subatomic scale in tightly knot- 
ted glueballs [5, 6, 7]. In many of these applications, the knotting is in an open arc 
(which we will call an open chain), i.e. the knotting occurs in a non self-intersecting 
arc with two free ends. Traditional knot theory, however, works predominantly with 
closed curves, i.e. curves that form loops. So while there is a long and rich history of 
studying knots mathematically, little of that work involves open chain knotting which 
might, indeed, be most relevant scientifically. 

Knotting has been studied mathematically since the 1800s with Gauss’s integral 
to measure the amount of interlinking between two loops [15]. Perhaps most famously, 
Lord Kelvin proposed that atoms were vortices in the ether (a hypothetical material 
that was thought to permeate the universe) forming different types of knots [45]. This 
hypothesis (which, ultimately, was discredited) inspired Peter Guthrie Tait to develop 
the first table of mathematical knots [40, 41, 42]. A mathematical knot is a simple 
closed curve, a one-dimensional non self-intersecting loop with no free ends. Given a 
mathematical knot, one can deform space (including the knot) continuously to obtain 
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equivalent knots. These topological deformations are akin to manipulating a piece of 
knotted rope, although the “rope” is infinitely stretchy, infinitely thin, and cannot be 
fully pulled tight. This equivalence then divides mathematical knots into classes of 
curves called knot types, which are the objects studied in knot tables. 

Applying the standard knot theory equivalence to open chains (with two free 
ends) turns out to be quite unrewarding since any open chain can be deformed topo- 
logically to a straight line segment, and thus all open curves are equivalent. So while 
we may perceive knotting in an open curve, the traditional mathematical techniques 
are not capable of classifying that entanglement. Still, as humans, we know knots 
when we see them, even if the curve happens to be open. This human intuitive notion 
of knotting was originally referred to as the “reasonable-person test” by Mansfield [25]. 
The question is how to take this human intuitive notion of knotting and make it con- 
sistent with the mathematical theory. In other words, how can we define knotting in 
open chains so that it coincides with the opinion of a reasonable person? 

While it is interesting, intellectually, to consider how one might measure knotting 
in open curves, the desire to formally analyze these structures has been enhanced by 
the discovery of entanglements within proteins. In particular, anyone with an internet 
connection now has access to the geometric structure of a dizzying number of proteins 
via the Protein Data Bank (PDB) [2]. Currently, the KnotProt database [18] lists over 
1200 structures that have been classified as containing some sort of knotting (although 
we wait for the next section to discuss more specifically about how knotting is detected 
in proteins). Proteins are chains of (typically hundreds of) amino acids. Each amino 
acid contains an alpha carbon. When researchers talk about knotting in proteins, they 
are really talking about knotting within the alpha carbon backbone (i.e. the open chain 
polygon with alpha carbons as vertices and edges joining alpha carbons whose amino 
acids are joined by peptide bonds). 

To provide a brief history of knotting in proteins, the first knotted protein was dis- 
covered by Richardson in 1977 [34]. In 1994, Mansfield surveyed known protein struc- 
tures (~ 400 from the Brookhaven Protein Database) [25] in an attempt to find knots, 
but concluded “In summary, none of the 400 protein structures analyzed were found 
to have knots. Only one, human carbonic anhydrase B, comes close.” He also ques- 
tioned how one should define knotting in proteins, since proteins are open chains. In 
1997, Mansfield used a different technique [26] to identify two knotted proteins (one 
being Richardson’s knot [34], the other having been identified by Takusagawa and 
Kamitori in 1996 [43]). However, both of these knots were “shallow”, i.e. the removal 
of a few amino acids on either end would remove the knot. Later, Taylor [44] discov- 
ered a “deep” knot (meaning that the knotting occurs away from the endpoints, deep 
into the protein), which cemented the idea that knotting could exist in proteins. The 
most complicated knot seen to date is the Stevedore knot, 6,, from [3]. See [12] fora 
recent survey on knotting in proteins. 

In Mansfield’s second paper on protein knotting [26], he made use of the fact that 
the ends of the proteins are on or near the “exterior” (i.e. the endpoints lie on the 
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convex hull), in which case one can unobtrusively close the protein chain via a simple 
arc that avoids the entangled portion (as in Figure 9.1A). If one were interested only 
in classifying knotting in such open chains, then the classification is straightforward 
and relatively unambiguous. 

However, the knotting of an entire structure may not reveal all of the entanglement 
within a curve. The primary example of this behavior is with slipknots, where the en- 
tire chain is unknotted but there are subarcs of the chain (which we call subchains) 
which are knotted. Yeates’s group did the first PDB search for slipknots, detecting 37 
slipknotted protein chains [20]. Currently, the KnotProt [18] database has cataloged 
over 400 slipknotted structures in the PDB. When analyzing subchains, we necessar- 
ily enter into situations where the endpoints are no longer on the “outside”, but rather 
are embedded directly in the complexity that defines the entanglement. In such cases, 
the classification of the knotting in the open chain is more ambiguous, oftentimes very 
ambiguous. 

The purpose of this note is to survey the different notions that have been proposed 
for measuring knotting in open chains. In Section 2, we introduce the several different 
definitions and discuss some of the advantages and disadvantages of each approach. 
In Section 3, we present image matrices which communicate the knotting seen within 
subchains of open and closed chains. In Section 4, we show some examples of im- 
age matrices for random closed chains and discuss some of the relationships between 
features of the image matrices and features of the corresponding configurations. 


9.2 Defining open knotting 


We should begin by saying that there is no universally agreed upon definition for knot- 
ting in open chains. Knotting traditionally has been studied on closed curves and an- 
alyzed topologically, in which case the equivalence allows a sort of movement of the 
configurations. For open chains, we enter into more of a static state where small per- 
turbations of a configuration can immensely change our perception of the knotting 
present in the curve. This paradigm shift is not unprecedented; the study of knot- 
ting in physical systems is replete with examples of searching for and studying certain 
special configurations, e.g. the minimizing configurations of the Mobius Energy (see 
e.g. [14, 22, 31]) or ropelength (see e.g. [4, 16, 23]). One of the reasons that there is no 
agreed upon definition is because different research groups from different disciplines 
have had different goals at different times. Currently, a main (and classic) tension is 
the balance between a definition having attractive properties versus being quick to 
compute. Having a “bad” definition that can be computed easily is not optimal, nor is 
having a “good” definition which is incomputable. So a compromise must be reached 
between the speed of computability and acceptance of limitations in the approach, 
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A B C 


Fig. 9.1: Three examples of open chains that should be classified as being knotted by a reasonable 
classification scheme. The example in A is what most people would consider to be a knot. This is 
known as an overhand knot and when closed in a simple fashion becomes a trefoil knot, 31. In B 
and C, we have closed knots from which a small part of the curve has been removed. Here again, we 
would expect a reasonable classification scheme for open chains to count these as being knotted, 
although the more precarious positioning of the opening in C may make us think that it is not as 
robustly knotted as B or A. 


and that delicate balance leads to different opinions depending on the goals of the 
research. 

In an attempt to define a mathematical notion of knotting for open chains, we be- 
gin by imagining properties that one might like the definition to satisfy. We discuss 
four such properties. First, the definition should be consistent with our intuition. In 
particular, the classification of the knotting in open entangled chains should roughly 
match the opinion of a reasonable person. For example, an overhand knot pulled 
tightly should correspond to a trefoil knot (as in Figure 9.1A). Second, open knotting 
should converge in some fashion to the classical knot theory of closed curves. In other 
words, by removing an arbitrarily short portion of a closed knot (see Figure 9.1, B and 
C), the definition should classify the open chain as the same knot type as the knot type 
of the original closed knot. Third, the definition should be well-defined, computable 
on a computer (at least for polygons), and not rely on any human interaction. Fourth, 
the classification should be stable in some sense. For example, an arbitrarily small 
perturbation of the endpoints of a given chain should not change the classification. 
Furthermore, a good classification scheme should minimize the effect of small per- 
turbations on the classification. Millett mentioned two of these properties in [29] (he 
called the second property “continuity” and the fourth property “robustness”). 

Specifically, our goal should be to define a notion of open knotting that is applica- 
ble to all open chains, not just a subclass of open chains (like full protein-like chains). 

Researchers have defined several notions of knotting for open curves. In the re- 
mainder of this section, we introduce several different techniques used to measure 
knotting in open chains and discuss some of the properties of these techniques. In 
particular, we focus on how these algorithms behave on polygons, since this is the 
class of curves on which the algorithms are typically applied. All of the techniques de- 
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fined to date have relied (to some extent, at least) on closing the open chain in one or 
more ways and using the knot type classifications from traditional mathematical knot 
theory. We discuss two different classes of techniques, single closure and stochastic, 
as well as some related techniques and the traditional topological approach to study- 
ing knotted arcs in the subsections below. Figures 9.6 and 9.7 show examples of the 
single closure and stochastic techniques applied to a common open chain. 


9.2.1 Single closure techniques 


For the single closure techniques, a single closed knot is generated for a given open 
chain, and the knot type of the open chain is classified as being the knot type of this 
closed knot. The techniques in this class have the advantage that they are quick to 
compute. For chains with the endpoints near the “outside”, e.g. most proteins, the 
single closure techniques (with the exception of direct closure) are consistent with 
the more involved techniques of the following section. When the endpoints are well 
within the convex hull of an open chain, the classifications via single closure tech- 
niques could very well anger our reasonable person, i.e. the knot type classifications 
of open chains do not match our intuition. Since there is no agreed upon definition of 
open knotting, these classifications are not “wrong” per se, but we might be able to do 
better in some situations. However, for the evaluation of knotting within large samples 
of open chains, the statistical differences between single closure techniques and more 
complicated schemes may justify (depending on one’s goals) the speedup gained by 
employing single closure techniques. For example, in [19], the authors estimated the 
error in using a single closure technique for 500-edge freely fluctuating polygons by 
measuring the probability of finding a “knotted domain” within subarcs of random 
closed unknots. They report an estimated ~ 3% error, although we note that some of 
these knotted domains within the unknots may not actually be “errors” but rather de- 
tected slipknotting. Using a single “topological neutral closure” and the same error 
estimation on lattice polygons of length 500, an error of = 0.2% was reported in [27]. 
Because of the self-avoiding nature of lattice polygons (i.e. the chains have an inherent 
thickness and, thus, tend to be more spread out), one might expect smaller errors in 
analyzing lattice polygons. As with the case of freely fluctuating chains, the algorithms 
may be detecting slipknotting. Also, lattice polygons with 500 edges are considerably 
more constrained than 500-edge freely fluctuating polygons, so these errors should 
not be compared directly. It is not clear exactly how to extrapolate the errors to other 
types of knots or to polygons with more edges, but the reported errors suggest that 
single closure techniques can be effective if one is willing to tolerate some error. 
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Fig. 9.2: Direct closure oftentimes does not match our intuition for knotting, even when the end- 
points lie on the convex hull. The closing segment (in blue) can seemingly remove (A) or add (B) 
knotting. 


9.2.1.1 Direct closure 


The easiest way to close an open chain is to simply connect the two endpoints with 
a straight line segment. The main disadvantage is that the closing line segment can 
interact with the knotting (and can be much longer than the segment length of the 
polygons), and thus can introduce knotting or remove knotting that, intuitively, one 
expects the technique to capture. Arguably the simplest example of real-life knotting 
is shown in Figure 9.1A where an overhand knot has been tied and the ends have 
been pulled away from the entangled portion of the curve. Most reasonable people 
would say that this configuration is knotted. However, by joining the endpoints with 
a straight line segment, one obtains an unknot (see Figure 9.2A). Similarly, one can 
construct configurations that most people would classify as unknotted but whose di- 
rect closure is arbitrarily complex (see Figure 9.2B). In particular, this technique often 
does not match our intuition for open chains whose endpoints lie on the “outside” 
of the chain. Note that no one has used this technique as is and we mention it here 
only for its simplicity. However, in a future paper we show that direct closure actually 
coincides with other notions more often than one might expect. 
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Fig. 9.3: For the center of mass technique, a subchain obtained by removing an arbitrarily short 
arc from a closed chain can yield a different knot type than the closed chain. In this example, the 
subchain of the KnotPlot 4; knot yields an unknot when the center of mass technique is employed. 


9.2.1.2 Center of mass 


For this technique [24], the center of mass of the open chain (the mass of which is 
assumed to be equally distributed at the vertices of the polygon) is computed. Two rays 
are then created, starting at the two endpoints of the chain. The direction in which the 
ray extends from each endpoint is given by the vector from the center of mass to that 
endpoint. The closed knot is defined by connecting the rays at infinity. On a practical 
note, we really do not need to connect the rays at infinity: once the rays have pierced 
the convex hull of the chain, we can cut the rays and connect via a simple polygonal 
arc that lies outside of the convex hull. This technique, at least in part, is an attempt 
to remedy the problems of direct closure with the examples of Figure 9.2, and indeed 
this technique is successful in doing so. 

One disadvantage of this technique is that when an endpoint of the chain is near 
the center of mass, a small perturbation of the endpoint will wildly affect the direc- 
tion of its associate ray, and thus the perceived knot type. In the non-generic situation 
where the endpoint is at the center of mass, the technique is undefined. Also, there 
are a number of situations where this technique might not match our intuition or con- 
verge to standard knot theory. For example, in Figure 9.3, a closed 4; knot has a small 
portion of the curve removed and is classified as an unknot. 
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9.2.1.3 Minimal interference 


The minimal interference technique [46] might be thought of as an attempt to wed the 
best of the previous two techniques. A main problematic situation for direct closure 
is when the two endpoints are far away from each other. On the other hand, it seems 
somewhat silly to extend rays to infinity (as in the center of mass technique) when the 
two endpoints are close together. In the minimal interference technique, the minimum 
distance from each endpoint to the convex hull is computed. If the sum of those two 
distances is less than the distance between the endpoints, then edges are added from 
the endpoints to the closest points on the convex hull and the chain is closed by adding 
asimplearc on the outside of the convex hull between these points on the convex hull. 
If the distance between the endpoints is smaller than the sum of the two distances to 
the convex hull, then direct closure is used. 

From a practical standpoint, this technique seems quite reasonable. The main 
problem comes from instability and not being well-defined. First, the distance from 
a point to the convex hull need not be realized uniquely, and the knots obtained by 
connecting to the different faces, in these cases, need not all have the same knot type. 
So defining the knot type via the closure on the convex hull is not well-defined. Note, 
however, this property is non-generic, and so this is not likely to be a problem com- 
putationally. Second, when the distance between the endpoints is equal to the sum of 
the distances to the convex hull, one has to decide which of the two choices to make, 
although, again this is a non-generic property. Third, when the sum of the distances 
to the convex hull is very close to the distance between the endpoints, a small pertur- 
bation of the chain can create a transition between the two protocols. For example, 
in Figure 9.4, we have removed a small portion of a KnotPlot 9 knot [36] where the 
straight line distance and sum of the distances to the convex hull are nearly equal. The 
direct closure creates a 9» knot, but the arc proceeding to the convex hull creates an 
unknot. Similarly, we could create configurations where a small perturbation creates 
an arbitrarily large difference in the complexity detected by the two protocols. 


9.2.1.4 Simplification schemes 


For the analysis of protein knotting, one of the earliest (and most popular) techniques 
involves simplifying the chain and is referred to as the KMT algorithm [21, 44] (al- 
though the ideas trace back at least to [33]). Protein chains are complicated. Trying 
to find a knot visually in the alpha carbon backbone will quickly leave the viewer with 
a headache. The goal of the simplification schemes is to eliminate unnecessary detail, 
reducing the number of vertices and edges to create an equivalent, yet much shorter, 
chain which is much easier on the human visual cortex. This simplication is accom- 
plished by searching for triangles formed by consecutive edges that are not pierced by 
any edge of the chain. When such a triangle is not pierced, one can remove the mid- 


184 — Erin Brine-Doyle, Madeline Shogren, Emily Vecchia, and Eric J. Rawdon 


» Zea Pro 
os 


SZ 


& 


Fig. 9.4: For the minimal interference technique, a small perturbation of the vertices can lead to 
large differences in the knot detected. In this example of a KnotPlot 92 knot [36], a 92 knot is de- 
tected when closed directly (left, red arc), but an unknot, 01, is detected when the arc is closed via 
proceeding to the convex hull (right, blue arc). 


dle vertex of the consecutive edges and replace the pair of consecutive edges with one 
edge between the two remaining endpoints (see Figure 9.5). For polygons whose end- 
points are on the “outside” (as is typically the case in proteins), this scheme seems to 
work quite well. One (usually) obtains a simple chain and connecting the endpoints 
via a large external loop provides a closed knot for classification. However, when the 
endpoints are not on the “outside”, such as in the search for slipknots or any sort of 
analysis of knotting in subchains, simplification is not effective. Furthermore, the or- 
der in which the delta moves are performed can affect the knot type classification of 
the chain, even when the endpoints are on the outside [28]. As such, classification 
by simplification is not well-defined mathematically and, thus, is not a good choice 
for classifying knotting in all open chains (although, to be fair, it was not designed to 
perform that task). 


9.2.2 Stochastic techniques 


For the stochastic techniques, the overall philosophy is that the open chain is an in- 
complete form, having the potential to be mathematically knotted but not, at this 
stage, having achieved its potential. The open chain is seen as a subchain of a class 
of closed knot configurations, where the class of configurations depends on the tech- 
nique. As such, the “knot type” of the chain is a probability distribution of knot types 
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Fig. 9.5: A delta move can simplify a configuration to make it easier to see the underlying knotting in 
a chain. If the shaded triangle is not pierced by any edge from the polygon, then the two edges can 
be replaced with the dashed edge. Note, however, that Millett et al. [28] showed that the order of 
the delta moves can affect the knot type classification of open chains. 


from the given class, and this probability distribution is approximated by sampling 
from the distribution. The stochastic techniques match our four criteria quite well. 
The main disadvantages are that 1) we must approximate the probability distribution, 
and 2) the algorithms can be orders of magnitude slower than the single closure tech- 
niques. Still, the authors hope to convince the reader that the stochastic techniques 
are the right way to think about open knotting. 


9.2.2.1 Uniform closure 


The idea here is to extend parallel rays from the endpoints of an open chain to infinity 
(like in the center of mass technique) and connect the rays at infinity. In particular, 
for a given open chain and a given direction (seen as a point on the unit sphere S7), 
parallel rays are extended from both of the endpoints in the given direction and the 
configuration is closed at infinity. The “knot type” of the open chain is then the prob- 
ability distribution of knot types obtained by completing this closure over all points 
on S*. Millett, Dobay, and Stasiak [28] defined the original version of this technique 
with a slight alteration: in their computations, they chose points on a large sphere 
containing the open chain. The differences between connecting at infinity and viaa 
large sphere are likely negligible. Still, connecting to a large sphere requires the choice 
of a center for the sphere and a radius and, therefore, requires some human choices 
that we would like to avoid [30]. 

Note that one could choose to send the rays to infinity in independent directions 
(S* x S* worth of choices) instead of one common direction (S? worth of choices). 
In fact, S* x S* was Mansfield’s choice in his first knotted protein paper [25]. In 
[17, 18, 32, 38], we use one common direction. This decision is due mostly to computa- 
tion speed. Typically, we have used 100 closure directions to obtain our approximation 
of the probability distribution. Sampling from a probability distribution, instead of us- 
ing a single closure, already puts the stochastic techniques at a significant disadvan- 
tage speed-wise. To get a similar density of sampling on S* x S?, we would need 1002 
sample directions. There might be some advantages of sampling from S? x S? in terms 
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of matching our intuition for knotting, although, to date, no one has systematically 
analyzed the differences. 

Also, there are some historical reasons for using common directions. In particular, 
the average crossing number is computed by averaging the number of crossings seen 
as a configuration is projected over all directions in S* (writhe is another example). 
Therefore, using common directions is akin to computing the average knotting of the 
given chain. 

Because this is the technique we have used in [17, 18, 32, 38], we discuss briefly 
its implementation. We use a fixed set of 100 directions (i.e. 100 points on S*) that are 
roughly uniformly spread over the unit sphere. The early papers (e.g. [24, 25, 28]) used 
random directions. Unfortunately, random is seldom uniform and the goal in sam- 
pling the directions is to get a good approximation of the probability distribution of 
knot types. So we have chosen uniform fixed directions (which also guarantees con- 
sistent results). We have computed the spherical Vornoi cells associated with these 
fixed directions and we weight the knot types obtained from the closure in a given 
direction by the relative surface area of the Vornoi cell for the corresponding point 
on S?. The choice of 100 directions is somewhat arbitrary, but we did do some analy- 
sis of the effect. More specifically, in the summer of 2012 undergraduate University of 
St. Thomas students Nicole Lopez and Elizabeth Annoni explored the effect of using 
different numbers of closures and different algorithms for generating roughly uniform 
sets of points on the sphere in measuring the knotting of open chains. Without getting 
into the details, we found that the roughly uniform set of points using Martin’s Poly- 
hedra [1] has good uniformity and using more than 100 closures has a negligible effect 
on the approximation of the probability distribution. The students also computed the 
relative surface area of the Vornoi cells that we use. In addition, as mentioned above, 
we do not really close the knot at infinity. We compute the maximum distance between 
any two vertices on the knot, and then extend the rays twice that distance in the given 
directions and connect with an arc of line segments that avoids piercing through any 
of the entanglement. 

Once one obtains an approximation of the probability distribution, there is still 
the question of what “knot type” to call a given chain. Certainly, it is not efficient to 
report a probability distribution as the knot type classification for an open chain. For 
the sake of definiteness, it is desirable in many situations to choose a classification of 
the knot type of an open chain. In such cases, we typically classify the knot type of 
a given open chain as the knot type that appears most frequently [17, 18, 32, 38]. The 
advantage of assigning the knot type with the highest probability to the open chain 
is that every open chain gets a classification as a knot type. The main disadvantage 
is that sometimes there are two (or more knot types) with similar probability values. 
In such cases, our approximation of the probability distribution might not be precise 
enough to truly classify the predominant knot type within the closures. One could 
sample more directions for these open chains. This adaptive approach has some merit, 
although it is not something that we have implemented. However, most open chains 
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have one knot type that appears more than 50% of the time, and typically even at 
much higher percentages. For example, in a sample of 1000 random equilateral open 
chains with 300 edges, Millett [29] reported that 99.6% of the chains had one knot type 
which appeared with over 50% probability (note that Millett used the original strategy 
of closing to points on a large sphere with random directions for closure, although 
this approach is not likely to alter the statistics considerably). Using the 50% cutoff 
also seems like a reasonable approach to classifying the knot type of an open chain, 
although it leaves the classification of some open chains as being ambiguous (i.e. no 
classification). While we have used the predominant knot type for classification, there 
certainly are contexts where a 50% cutoff might be preferable. 


9.2.2.2 Random arc closure 


Recall that the strategy with the stochastic techniques is to approximate a probabil- 
ity distribution one obtains by closing the configuration in some fashion. Perhaps the 
most natural way to generate such an ensemble is by closing the configuration by 
connecting the endpoints with random arcs. Cantarella et al. [9] have created an algo- 
rithm (and software) for efficiently generating random equilateral polygons and ran- 
dom equilateral arcs with a given distance between the endpoints. The main problem 
with this approach is that one must choose the length of the random chain (i.e. the 
number of edges) with which to close a given open chain. 

Note that in an upcoming sequence of yet untitled papers, the authors of this pa- 
per, in collaboration with Cantarella and Shonkwiler, are analyzing how well some 
of the different strategies presented here predict the knot type of a closed knot given 
some subchain portion of the knot. In such a case, we know precisely how many edges 
have been removed to extract the open chain from the closed knot, so we can replace 
the missing portion of the knot with random arcs with the correct number of edges. For 
arandom open chain (ora protein, for example), it is not so clear how one might deter- 
mine an appropriate number of edges with which to connect the endpoints with ran- 
dom arcs. However, this issue might not be insurmountable. For example, Zirbel and 
Millett [47] determined the average distance between the endpoints of differing length 
subchains for closed polygons. One might use that information to devise a scheme for 
choosing the number of edges in the replacement arcs. 


9.2.3 Other closure techniques 


For the sake of completeness, we mention other techniques (mainly variations of the 
ideas presented above) that have been employed in some papers. In [19], the authors 
attach an “almost complete circle” to the endpoints of an open chain in a random 
way to obtain a closed knot. In [27], the authors connect the endpoints at infinity us- 
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A B C 
Fig. 9.6: The closed chains coming from single closure techniques associated with a subchain of the 
98-edge KnotPlot +75 knot [36]. From left to right, we have (A) direct closure, (B) center of mass, and 
(©) minimal interference. 


es 


Fig. 9.7: The closed chains coming from stochastic closure techniques associated with a subchain of 
the 98-edge KnotPlot +7; knot [36]. For (A) uniform and (B) random arc, we show three examples of 
closures from the technique. 
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ing a pair of directions which is chosen “to minimize the risk of knot modifications or 
disentanglements”. In [20], the authors use a combination of random closures (seem- 
ingly from S* x S$), the center of mass technique, and a deterministic technique. In 
the deterministic approach, they extend interior endpoints to the exterior by adding 
length to the chain as if it is “repelled from the protein surface, until they reach an 
outer sphere”. We guide the reader to these articles for more details. 


9.2.4 Topology of knotted arcs 


Because the standard deformations used to define closed knot equivalence can turn 
any embedded open arc into a straight line segment, different techniques are required 
to study knotted arcs topologically. Classically, this topological description has been 
in the form of the so-called ball-arc pair [8]. The ball-arc pair consists of an open 
(non self-intersecting) chain c and an oriented topological ball B (which need not be 
“round”) with the property that the endpoints of the chain lie on the surface of the ball 
and the interior of the chain lies in the interior of the topological ball. Then two ball- 
arc pairs (B,, c,) and (Bo, cz) are equivalent if there exists an orientation-preserving 
homeomorphism between B, and B2 which takes c, to c2. In [39], the authors use 
the ball-arc pair to define when an open lattice chain forms a “tight knot”, a concept 
which was critical in the proof that long lattice knots are almost always knotted. 

For a given open chain, the assignment of a topological ball (to create a ball-arc 
pair) is equivalent to the assignment of a corresponding set of closures for the chain, 
all of which have the same knot type. To see this, suppose we are given a ball-arc 
pair (B, c). If we join the two endpoints of c via simple arcs lying on the boundary of 
B, we will obtain closed knots with a common knot type. These simple arcs are the 
corresponding set of closures. 

Conversely, the assignment of a closing arc to a given open chain is equivalent 
to the assignment of a corresponding set of topological balls, namely the set of balls 
whose 1) interiors contain the interior of the chain, 2) boundaries contain the end- 
points of the chain, and 3) boundaries contain the closing arc. The ball-arc pairs from 
this set of balls with the given chain are all equivalent as ball-arc pairs. 

For any open arc and any knot type, one can construct a topological ball so that 
the closed knot obtained via a simple arc along the boundary of the ball results in the 
given knot type. Similarly, for any given open arc and any knot type, one can construct 
a closed knot of the given knot type which contains the arc as a subchain. All of the 
techniques presented in this section essentially restrict the set of closures, and thus, 
the set of ball-arc pairs from which one is sampling in a particular fashion. It may 
be possible to define the knotting in open chains by restricting to a given set of balls 
for the ball-arc pair instead of working at the level of closures, although it is not clear 
how one might do so. Still, while much of the modern literature on open knotting does 
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not specifically mention the ball-arc pair, it is important to acknowledge the historical 
topological contributions and note the connections to current work. 


9.3 Visualizing knotting in open chains using the 
knotting fingerprint 


In this section, we introduce the knotting fingerprint as a means of visualizing the 
knotting in open subchains of an open or closed chain. The knotting fingerprint can be 
used with any of the above techniques. The basic idea is to generate an image matrix, 
where each subchain is associated with a unique cell in the matrix, and use colors in 
the cells to communicate the knotting within the subchain. The first such fingerprints 
were seen in [20], in their work to find slipknots within proteins. For open chains, the 
fingerprint takes the form of a triangle, where the position along the x-axis indicates 
the starting vertex index of the given subchain and the position along the y-axis in- 
dicates the ending vertex index. Since the ending vertex index must be greater than 
the starting index, the portion of the matrix above the main diagonal (i.e. the diago- 
nal from upper left to lower right) is empty and the diagonal indicates the behavior 
of zero-length subchains. The entire chain is associated with the sector in the lower 
left corner. The triangular fingerprint also has been used in [18, 38] and seems to be 
accepted as the standard. An example of the image matrix for the DehI protein (PDB 
code 3bjx), the protein forming the most complicated knot yet seen (+61), is shown 
in Figure 9.8 using the uniform closure technique. The knot type of a subchain is the 
knot type appearing with the highest percentage in the closures of the subchain and 
the opacity of the coloring within the cells is that percentage. The ordering of the key 
(from top to bottom) is by the length of subchain at which the knot type first appears. 
We discuss the coloring of the cells more below where we compare the images coming 
from the different approaches. 

To illustrate different ways of visualizing the knotting of subchains, in Figures 
9.9, 9.11, and 9.12, we analyze a relatively nice configuration, the KnotPlot +75 [36], an 
image of which appears in the right picture of Figure 9.9. This knot was chosen because 
the resulting pictures have some, but not too much, complexity. 

For closed chains, using the square image matrix is a little awkward (see the left 
image of Figure 9.9). The problem is that there is a subchain from every vertex to ev- 
ery other vertex, so this is like St x $+ (which, topologically, is a torus) or S* x [0, J] 
(where | is the length, which gives us a cylinder). In [46], the authors adapt the open 
chain approach for closed chains: the x-axis indicates the starting vertex index and 
the y-axis indicates the ending vertex index. As such, the blank part of the matrix 
for open chains gets filled in with subchains that include the vertex with index zero. 
This approach is perfectly reasonable, although the matrices become hard to read due 
to periodic nature of S1 x S1. In particular, just below the diagonal, one has length 
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200 


Fig. 9.8: The image matrix for the Dehl protein (PDB code 3bjx) using the uniform closure technique. 


one subchains, but right above the diagonal, one has the whole chain minus an edge. 
Furthermore, there is a vertical wrapping of the image from bottom to top. One could 
“fix” the diagonal issue by having the x-axis indicate the starting vertex index and 
the y-axis indicate the length of the subchain (see the right image of Figure 9.9). There 
would still be a wrapping from left to right (or one could show the image on a cylinder), 
but this approach would be easier on the eye. This approach has not been employed, 
seemingly due to the desire to be consistent with the original triangular matrix used 
for proteins in [20]. The mathematically correct way to show the image would be on 
a torus or cylinder, although 3D images are not easy to see or manipulate and it is 
not clear that there would by any unique insight from putting the image on a torus or 
cylinder. 

In Figure 9.10, we show the disk matrix [17, 32] for an 81-edge KnotPlot +5> knot. 
For the sake of the disk matrix, the polygon is an oriented based list of n coordinates 
in 3D space, i.e. we specify a first vertex (the base vertex) and a second vertex (which 
uniquely determines the orientation) to the list. The disk matrix is most easily de- 
scribed in terms of polar coordinates: the polar 6 value indicates the midpoint of the 
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Fig. 9.9: Two square image matrices for the 98-edge KnotPlot +75 knot [36] (configuration shown 
on the right image) that communicate the knotting within its subchains using the uniform closures 
technique. On the left, the x-axis indicates the starting vertex index and the y-axis indicates the 
ending vertex index. On the right, the x-axis indicates the starting vertex index and the y-axis indi- 
cates the number of edges in the subchain. 


subchain and the r value indicates the length. The disk matrix has a sort of brick pat- 
tern since open chains with odd numbers of vertices have a vertex midpoint, but open 
chains with even numbers of vertices have a midpoint which is between two vertices. 
Adjacent to the center of the disk are all length one subchains, which are too short to 
be knotted, and so are all colored as the unknot, 0,, in light gray. As we move out from 
the center, we are seeing the knot type of longer and longer subchains, until the last 
stripe which contains subchains with n vertices and n — 1 edges. Let us fix a particu- 
lar vertex v of the configuration. One can add an edge on both sides of v to obtain a 
subchain with two edges, do this again to obtain a subchain with four edges, etc. All 
of these subchains have a common midpoint, namely v. In Figure 9.10, the polar sec- 
tors associated with these subchains have been colored black. The orange and green 
sectors correspond to the subchains that have v as a starting and ending vertex, re- 
spectively. Since we can add one edge on either end as opposed to when we add edges 
at both ends, the orange and green sectors appear at every radius whereas the black 
sectors appear at every other discretized radius. In the next section, we will see the 
orange and green spiral patterns appear frequently. We compute the knot types only 
for the subchains ending at vertex points. One could subdivide the edges to get a finer 
image disk matrix and this would provide more detail. Note that changing the orien- 
tation of the knot would have the effect of flipping the disk matrix across the x-axis 
and changing the identification of the base vertex would have the effect of rotating the 
matrix. 

For the single closure techniques, one has a single “knot type” for any given sub- 
chain. For the stochastic techniques, the “knot type” of a subchain is a probability 
distribution. In [17, 32], and even our work with open chains [18, 38], we assign the 
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Fig. 9.10: The disk matrix for the 81-edge KnotPlot +52 knot [36] using the uniform closures tech- 
nique. On the left, the disk matrix is shown with the configuration superimposed. On the right, the 
black sectors correspond to subchains with a fixed vertex midpoint. The orange and green sectors 
correspond to subchains with the same vertex as the starting and ending vertices, respectively. The 
polar @ value indicates the midpoint of the corresponding subchain and the polar r indicates the 
length of the corresponding subchain. Each sector is colored by the knot type which appears most 
frequently when closed via the uniform closure technique, with the opacity of the coloring set to 
the percentage of closures with that knot type. The key to the right of each disk matrix shows the 
color associated with each knot type that appears in the matrix along with a swatch of that color at 
opacities 10%, 20%, ---, 100%. 


knot type of an open chain to be the knot type appearing most often in the distribution 
of closure knot types. Clearly, we are losing information here, although, as mentioned 
above, for the great majority of open chains there is one knot type that appears in over 
50% of the closures. To give a sense for the dominance of the knot type, we set the 
opacity of the color within each cell to be the percentage of closures that are the given 
knot type (swatches of the colors at opacities 10%, 20%, «++, 100% appear in the key 
to the right of each disk matrix). Thus, light coloring indicates a low percentage and 
dark coloring indicates a high percentage for the dominant knot type seen amongst 
the closures. Like with the open chain triangular/square matrices, the key on the right 
is ordered by the length of the subchain at which the knot type first appears. 
KnotPlot +75 disk matrices for the single and stochastic closure techniques are 
shown in Figures 9.11 and 9.12, respectively. Because this configuration is relatively 
spread out and smooth, we would expect a higher level of agreement between the dif- 
ferent techniques than for less well-behaved configurations. By eye, the images look 
similar in some ways and different in others. The direct closure technique is the out- 
lier here, although it might be more consistent with the other techniques than one 
would expect, especially when compared to the disk matrices for the stochastic tech- 
niques. The disk matrices for the center of mass and minimal interference techniques 
share features and the uniform and random arc closure disks share features, although 
many of the features differ between the two pairs. The single closure techniques tend 
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Fig. 9.11: Disk matrices for the 98-edge KnotPlot +7; knot [36] using single closure techniques: 
direct closure (left), center of mass (middle), and minimal interference (right). 


uniform closures random arc closures 


Fig. 9.12: Disk matrices for the 98-edge KnotPlot +7; knot [36] using stochastic closure techniques: 
uniform closures (left) and random arc closures (right). 


to introduce more subchain knot types than the stochastic techniques, as is seen in 
this example where +62 and +7, appear for the single closure techniques but not 
the stochastic techniques. Furthermore, the single closure techniques tend to have 
sharper features than the stochastic techniques. 


9.4 Features of knotting fingerprints, knotted cores, 
and crossing changes 


In this section, we show some examples of disk matrices, using the uniform closure 
technique, for several random knot configurations. We make some observations about 
the relationship between the configurations and their disk matrices. This section is by 
no means comprehensive, but rather we hope that it whets the reader’s appetite for 
understanding how open knotting manifests itself in closed chains. 
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We begin with a highly localized knot in Figure 9.13. We know that the —3; knot is 
highly localized because the knotting appears near the center of the disk and persists 
at longer lengths (i.e. near the edge of the disk). In general, when the subchains have 
sufficient length to change the knot type, we see cusp-like widow’s peaks appearing 
with a new color. Note that there are some cells on the edge of the disk matrix that 
are gray, indicating that the corresponding 149-edge subchains are unknotted. In this 
case, this behavior occurs because the polygon passes very close to itself at the borders 
of the teal region of the configuration (the minimal knotted region). The removal of 
one edge here is enough to see the subchain as unknotted. If one were to subdivide 
the polygon’s edges, the gray regions would not occur on the border, so this behavior 
is an artifact of the discretization of the disk matrix. 


01 


Fig. 9.13: The disk matrix for a highly localized 150-edge —3, knot, with an image of the configura- 
tion superimposed over the disk matrix. The teal region of the configuration is the shortest sub- 
chain that is knotted. 


In Figure 9.14, we see a disk matrix that has two clear peaks. This behavior is not 
unusual and we often seen disk matrices with multiple widow’s peaks. In this config- 
uration, there are two simple loops (on the left in a gray colored arc and on the right 
in a green colored arc) and a core red arc region. To form the —3;, the subchain must 
pass fully through the red middle region and one of the green or gray regions. The two 
peaks correspond to the red+green and red+gray subchains. So, in general, a peak 
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occurs when the arc has completed some, but not all, of the knotting, and an unknot- 
ted arc wanders away before coming back to complete the knotting. In fact, we see 
the same behavior in Figure 9.13 on the very left portion of the disk matrix. However, 
in Figure 9.13, the knotting is so tight in this area of the polygon that this wandering 
region is very short. 


0, 


Fig. 9.14: The disk matrix for a 50-edge —3, knot with double peaks. 


Typically, we see a lot of different knot types appear in the disk matrix. Some of 
these knot types are intermediate entanglement in subchains needed to create the 
more complicated knotting of the entire knot (which will be discussed in more detail 
below). However, oftentimes the knot types are not “contributing” to the knotting of 
the whole curve. The simplest such example is the case ofa slipknot, i.e. an unknotted 
closed curve that contains knotted subchains. A knotted subchain within an unknot- 
ted closed curve was termed an ephemeral knot by Millett [29]. For the sake of our 
discussion, we consider ephemeral knots to be knotted subchains whose knotting is 
not a key feature to the knotting of the entire closed knot. 

In Figure 9.15, we see an example of a configuration with ephemeral knots [29]. It 
is difficult to see anything within the configuration, but we include a picture for the 
sake of completeness. There are (at least) three green streaks that are ephemeral —3, 
knots. Note that these green streaks can be broken for a number of cells, forming an 
unknot or some other type of knot in the mean time. When the streak is broken, it is 
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unclear whether to mentally join them or to call them new ephemeral knots (although 
we leave that discussion for the next paragraph). In this example, the -3, slipknots 
have only one cell depth (although in Figure 9.16, we see more robustness along these 
lines), so the removal of one vertex on one of the ends is enough to unknot the arc. 
On the other hand, adding vertices to the other end keeps the knotting for a number 
of vertices. 

In addition, there is one very short (lasting only for a single edge) +3, slipknot 
with a black arrow pointing to it in the image. The subchain corresponding to this cell 
is a +3, knot, but it seems safe to assume that the knotting of this subchain is not 
a key feature to the knotting of the base +3; knot (although that could be debated). 
In some cases, like Figure 9.13, it is pretty clear where the core of the knotting is. But 
in many/most cases, finding the core is not so simple. Tubiana et al. [46] defined the 
bottom-up and top-down procedures to define the knotted core (which they called 
the shortest knotted arc). In the bottom-up procedure, one searches for the shortest 
subchain with the same knot type as the base knot. In the top-down procedure, one 
searches for the shortest subchain that has the knot type of the base knot and is a 
subchain of increasing length subchains that also have the same knot type as the 
base knot. In the disk matrix, this is akin to searching from the center outwards (for 
bottom-up) and searching for the widow’s peak closest to the center (for top-down). 
The example of Figure 9.15 shows that the bottom-up procedure might not provide the 
information one expects. On the other hand, thinking of the subchain corresponding 
to the peak closest to the center (the one above and slightly to the right of center), the 
+3, becomes unknotted for some longer chains containing that subchain. One might 
be tempted to forgive the +3, for this slight transgression and declare that subchain 
as the core. Or we might forgive the +3, for having longer subchains that form the 4; 
knot since, in some sense, the 4, is more complicated that the +3. Ultimately, we do 
not have answers to these questions, but rather wish to illuminate the reader about 
some of the issues in defining the knotted core. 

Also note that the configuration from Figure 9.15 has a number of different knot 
types that appear in only a few cells, which is typical for random knot configurations 
using any of the notions of open knotting. 

Ephemeral knotting also can form after the creation ofa given knot. In Figure 9.16, 
we have a —3, knot which adds an ephemeral —3, knot (golden region) to form a com- 
posite knot. The extra —3; persists nearly to the entire length of the configuration. Here 
the ephemeral knotting is more robust than in Figure 9.15 in that there are subchains 
where both endpoints can be removed and the —3,# — 3; knot remains. 

One might label the core as the subchain associated with the green cell closest 
to the center. However, note that there is a streak of gray cells signifying unknotted 
chains. As mentioned above, one might forgive this transgression as it only has depth 
one. Still, if we forgive the depth one transgression, then why not depth two, or three, 
or more? Certainly the composition of -3, with the additional -3; would seem to be 
a forgivable offense, as well as the smattering of -8;9 subchains. 
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Fig. 9.16: The disk matrix for a 150-edge —3, knot with ephemeral knotting in the form of knot com- 
position. 


One way of “adding” ephemeral knotting is in the form of knot composition, as 
is seen in Figure 9.16. However, one can also add knotting by introducing additional 


Open knotting —— 199 


entanglement in the form of a more complicated knot. In Figure 9.17, we have a +3, 
which becomes additionally tangled to form a +55 knot. This additional entanglement 
eventually disappears, leaving only the +3 . In [17, 32], we introduced the idea of sub- 
knotting (here +3; would be considered a subknot of +5,). We might also consider +54 
to be a superknot of +3;. In fact, one might consider creating a family tree of knotting 
using the idea of subknots and superknots. Not surprisingly, the creation of such a 
family tree is not clear cut, although we guide the interested reader to [17, 32] for some 
discussion along these lines. 


01 


= +3; 

- +52 

\ Ge 

| / ini 63 
= -31 

= +7s 
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Fig. 9.17: The disk matrix for a 150-edge +31 knot with ephemeral knotting in the form of more com- 
plicated knotting. 


The idea of a family tree (or, similarly, a graph connecting certain knots) has been 
explored, most notably with regards to crossing changes (see e.g. [10, 11, 13]). In [32], 
we showed that subknotting is similar to, but not the same as, crossing changes per- 
formed on a projection of a configuration. The knot type +5 has unknotting number 
one, meaning that it is possible to perform one crossing change and transition from 
an unknot to a +52 knot, or vice versa. In Figure 9.18, we see a +52 knot that is created 
by transitioning straight from an unknot. The +5> knot also can be created by making 
a+3, knot and then adding complexity, as can be seen in Figure 9.19. 

Figure 9.20 has a number of interesting features. First, we see the creation of the 
+55 via +3, transitioning to +5,, which then transitions to +73 (getting more compli- 
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Fig. 9.18: The disk matrix for a 150-edge +52 knot whose subchains transition straight from an un- 
knot to a +52 knot. 
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Fig. 9.19: The disk matrix for a 150-edge +52 knot whose subchains form +3; knots before transition- 
ing to the +52 knot. 


cated) and then down to the +5. Second, we see an ephemeral 4. This 4; is likely 
not as ephemeral as it might appear since, despite the unknotted padding between the 
regions, the 4, likely creates twisting that is later needed in the transition to the +5). 
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Finally, we see transitions from the unknot straight to +5 , and also passing through 
intermediate +3, and 4, states. 


Fig. 9.20: The disk matrix for a 50-edge +52 knot with a complex transition from unknot to +55. 


9.5 Conclusions 


We have presented the definitions of the commonly used notions of open knotting, 
including some discussion of desirable and undesirable properties of each. We intro- 
duced different matrix representations of the knotting within subchains of open and 
closed chains. Finally, we showed some examples of disk matrices for random knots 
and highlighted the relationships between some features of the disk matrices and the 
configurations. 

We believe that the uniform closure method provides the most desirable prop- 
erties for analyzing knotting in open chains. The main disadvantage is computation 
speed, and, indeed, with respect to speed, uniform closure is at a significant disad- 
vantage to the single closure techniques. Furthermore, we believe that the disk matrix 
provides an intuitive means for the viewer to glean information about the knotting 
within a closed chain. 
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While researchers might not agree on which notion of open knotting to use, we 
hope the discussion here will inform future decisions. 
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Abstract: It is well known that knots exist in natural systems. For example, in the case 
of (mutant) bacteriophage P4, DNA molecules packed inside the bacteriophage head 
are considered to be circular since the two sticky ends of the DNA are close to each 
other. The DNAs extracted from the capsid, without separating the two ends, can pre- 
serve the topology of the (circular) DNAs, and hence are well-defined knots. Further- 
more, knots formed within such systems are often varied and different knots occur 
with different probabilities. Such information can be important in biology. Mathemat- 
ically, we may view (and model) such a biological system as (by) a random knot space 
and attempt to obtain information about the system via mathematical analysis and 
numerical simulation. The question here is to find the probability that a randomly 
(and uniformly) chosen knot from this space is of a particular knot type. This is equiv- 
alent to finding the distribution of all knot types within this random knot space (called 
the knot spectrum in an earlier paper by the authors). In this paper, we examine the 
behavior of the knot spectrums for knots up to 10 crossings. Using random polygons 
of various lengths under different confinement conditions as the random knot spaces 
(model biological systems), we demonstrate that the relative spectrums of the knots, 
when divided into groups by their crossing numbers, remain surprisingly robust as 
these knot spaces vary. For a given knot type K, we let Px(L, R) be the probability 
that an equilateral random polygon of length L in a confinement sphere of radius R 
has knot type K. We give a model for the family of functions P(L, R) and show that 
our model function fits the random polygon data we generated. For a fixed crossing 
number Cr, 3 < Cr < 10, let Sc, be the subspace consisting of random polygons which 
form knots that have crossing number Cr. We study the relative distribution of all the 
different knot types within S;, and illustrate how this distribution changes if we keep 
the length L fixed (or the confinement radius R fixed) and vary the confinement radius 
R (or the length L). We observe that this distribution is quite robust and remains es- 
sentially unchanged under length and confinement radius variation, especially if one 
concentrates on subfamilies such as alternating prime knots, non-alternating prime 
knots, or composite knots. 
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10.1 Introduction 


In the world of pure mathematics, or more specifically knot theory, one typically con- 
siders the space of knots as the entire collection of all knots, with each distinct knot 
type as an element of this discrete space. It is commonly agreed that some knots are in 
general “more complicated” than other knots, although there are many different ways 
to define knot complexity and sometimes a knot more complicated with one complex- 
ity measure may be simpler in terms of a different complexity measure. For example, 
some knots with high crossing numbers have small bridge numbers and some knots 
with relatively small crossing numbers have high bridge numbers. One could imagine 
that in an ideal knot space, each knot type is no more probable than any other knot 
type. However, this is not the case for knots occurring in the world of nature. For ex- 
ample, if one is to tie a knot with a rope of some fixed length, then one can only tie 
finitely many different types of knots ([4, 10, 11, 25]). Furthermore, if one is to tie a 
knot with a rope in a random fashion, then one will inevitably reach the conclusion 
that some knots are “easier” to tie than other knots (meaning some knots will be tied 
more often than other knots if the experiment is repeated). Some important biologi- 
cal problems concerning subjects such as DNA packing are also related to this topic. 
In DNA research, a relatively simple virus called bacteriophage is commonly used to 
study the DNA packing mechanism. This virus keeps its genome in a spherical protein 
container called a capsid. Although the packing of the DNA inside the capsids cannot 
be directly observed, DNA extracted from the capsids without being broken retains its 
topological information and this information is used as a probe in studying the DNA 
packing mechanism [2]. It is reported that many different DNA knots form within the 
capsids of bacteriophages, with certain knot types appearing with much higher fre- 
quencies [2]. 


Mathematically, a biological system such as the DNA knots within the bacterio- 
phage head is really just a “knot space” whose elements are geometric closed curves 
with a certain (continuous) probability distribution. If one is to randomly and uni- 
formly sample a knot from this space, the probability that one gets a particular knot 
type is the same as the percentage of knots of this knot type within the space. The 
distribution of the knot types within the space is called the knot spectrum in [12, 16]. 
In a sense, the knot spectrum (of a knot space) measures how easily a knot type can 
be realized by a randomly chosen knot in that space and provides a knot complexity 
measure. While it is quite conceivable that specific knot spaces can be constructed to 
favor a particular type of knots, we are interested in the following question: Are there 
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knots that are intrinsically easier to form in most random knot spaces without obvious 
topological biases? For example, in the space of random equilateral polygons with a 
fixed length, it is known that composite knots have higher frequencies than the prime 
knots when the polygons are long (in fact the probability of getting a prime knot goes 
to zero as the length of the polygon goes to infinity [9]). 


In this paper, we study the above problem by examining the behavior of the knot 
spectrum for knots up to 10 crossings. Using random polygons of various lengths un- 
der different confinement conditions as the random knot spaces (model biological 
systems), we demonstrate that the relative spectrums of the knots, when divided into 
groups by their crossing numbers, remain surprisingly robust as these knot spaces 
vary. For a given knot type K, we let Px(L, R) be the probability that an equilateral 
random polygon of length L in a confinement sphere of radius R has knot type K. We 
give a model for the family of functions P3.(Z, R) and show that our model function fits 
data we generated from a large sample of random polygons. Furthermore, for a fixed 
crossing number Cr (3 < Cr < 10) we consider the knots space Sc; of random poly- 
gons that form knots with Cr crossings. We observe how the frequency distribution of 
knots in Sc, changes under various lengths and confinement conditions. One of the 
main findings in this article is that this distribution is quite robust and remains es- 
sentially unchanged under length and confinement radius variation, especially if one 
concentrates on subfamilies such as alternating prime knots, non-alternating prime 
knots, or composite knots. The support of the previous statement is given by the data 
we collected. The details of the data collection will be explained in Section 10.3. Even 
though our data set is quite large, it does not allow us to support any statements on the 
individual knot type distributions for knots with more than 10 crossings since there are 
very few samples per knot type for crossing numbers above 10. Furthermore, already 
for nine and 10 crossings, the natural sampling error is large enough to only weakly 
support any conjecture about the independence of knot type distributions on polygon 
length and confinement radius. As a final remark, we need to acknowledge that we do 
not see any way to prove any statement on knot distributions. At this point we cannot 
even prove much simpler statements like: The probability that a random polygon is 
knotted is larger if the polygon is confined when compared to an unconfined random 
polygon of the same length. 

This article is organized as follows: In Section 10.2 we introduce some basic ter- 
minology in knot theory. In Section 10.3 we outline the size of our random polygon 
sample and how the knot types in our sample were determined. In Sections 10.4 and 
10.5 we explain the choice of a model function for Px(L, R) and show how well our 
data fits the model. Next, in Sections 10.6 and 10.7 we study how the confinement ra- 
dius R and the polygon length L affect the distribution of the different knot types in 
Scr. Finally, in Section 10.8 we summarize our findings. 
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10.2 Basic mathematical background in knot theory 


For the convenience of our reader, we outline and discuss briefly a few topological 
concepts that are most relevant to this paper. For a more detailed exposition, please 
refer to a standard text on knot theory such as [1, 6, 21, 26]. 


A knot K is a simple closed curve in R?. Here we assume that such a curve is a 
piece-wise smooth curve (this includes a space polygon without self-intersections). 
Two knots are considered topologically equivalent if one can be continuously de- 
formed, together with the entire R? space surrounding it and without being broken 
or causing self-intersection in the process, to the other. The class of all knots equiva- 
lent to a knot is called a knot type. The knot type that contains the unit circle is called 
the trivial knot (type). For a fixed knot K, a regular projection of K is a projection of 
K onto a plane such that no more than two segments of K cross at the same point 
in the projection. At each intersection point of a regular projection of K, it is usually 
marked which strand is over and which strand is under. These intersection points are 
called crossings. A regular projection with this over/under information marked at the 
crossings is also called a knot diagram. The minimum number of crossings among all 
possible knot diagrams of knots with the same knot type as K is called the crossing 
number of K (and is usually denoted by Cr(K)). A knot diagram is alternating if at the 
crossings under and over alternate as one travels along the knot projection. A knot 
type is alternating if it has an alternating diagram and is non-alternating if it does not 
have any alternating diagram. A knot is called a composite knot if it is realized by con- 
necting two nontrivial knots as shown in Figure 10.1. Ifa knot is not a composite knot, 
then it is a prime knot. 
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Fig. 10.1: A composite knot from two non-trivial knots K; and K2. 


One fundamental problem in knot theory is to determine the knot type of a given 
knot. The most common and powerful tools for this purpose are various knot poly- 
nomials. The well-known knot polynomials include the Alexander polynomial, the 
Jones polynomial, and the HOMFLYPT polynomial. These polynomials can be com- 
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puted from any knot diagram and remain unchanged when computed from a different 
knot diagram of the same knot type. In this paper, the authors rely on the HOMFLYPT 
polynomial and the knot table for knots up to 16 crossings for knot identification. For 
the definition of the HOMFLYPT polynomial and its properties, one may refer to [24]. It 
is important to note that the knot polynomials are not sufficient to distinguish all knot 
types and the tabulation of knots remains a difficult question in general. Furthermore, 
only prime knots up to 16 crossings have been completely tabulated [20]. 


10.3 Spaces of random knots, knot sampling and 
knot identification 


Ina series of papers, three authors of this paper have developed algorithms for several 
models to generate equilateral random polygons that are confined inside a sphere of 
fixed radius [13, 14, 15, 17]. The model presented in [14] is the one chosen for the study 
presented in this paper. The model can be described as follows: Consider equilateral 
random polygons that are “rooted” at the origin and assume that there is an algo- 
rithm that samples such objects with uniform probability. Now consider a confine- 
ment sphere Sp of radius R = 1 with its center at the origin. We keep those randomly 
generated equilateral polygons that are contained in the confinement sphere Sp. Note 
that the algorithm used in [14] to generate polygons in confinement is not based on 
a direct accept-reject method (since such a method is extremely inefficient). Instead, 
it uses conditional probability density functions that can be explicitly formulated to 
guide the generation process. Each polygon is generated one edge at a time and there 
is no rejection involved. Furthermore, this algorithm generates polygons that are to- 
tally independent of each other so no de-correlation is necessary. Interested readers 
please refer to [14] for a detailed description of this algorithm. There is no biological 
or other reason for the polygons to be rooted at the center. It is rather a choice for sim- 
plicity: as it turns out, equilateral random polygons defined this way are much easier 
to generate due to the symmetry of the confining sphere (relative to the root) imposed 
on the equilateral random polygons. In order to obtain statistically significant results, 
it is necessary to use large samples and the simplicity of this model allows the compu- 
tations to be feasible. We want to point out to the reader that, recently, a different and 
very efficient method to generate such confined polygons (also centered at the origin) 
was developed, see [7]. 


Our knot space consists of all random equilateral polygons of a given length and 
a given radius of confinement. By varying these two parameters, we obtain a family 
of knot spaces. Notice that the differences among these knot spaces are imposed by 
different geometric conditions and no explicit topological constraints exist. The data 
set used for our analysis in this paper comes from two sources. 
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The larger portion of the data is new and we describe how it is obtained here. 
To trace the effect of polygon length on knotting, we fixed the confinement radius at 
R = 3 and used a range of lengths of the random polygons from 10 to 90 in incre- 
ments of 10. We used a maximal polygon length of 90 steps and a confinement radius 
R = 3 to ensure that, even for the longest sample, we can still identify the knot type 
for most polygons. This means that many of the knots that are generated must have at 
most 16 crossings so they are in the current knot table. From our past experience [12], 
we know that if we choose a confinement radius that is smaller than R = 3, then we 
would not be able to identify the knot types for the longer lengths. To trace the effect of 
the confinement radius, we fixed the polygon length at 30 steps and used radii of con- 
finement ranging from R = 1 to R = 4.5. In more detail, between R = 1 and R = 3 the 
confinement radius increases with an increment of 1/10, while from R = 3 to R = 4.5 
the confinement radius increases with an increment of 1/2. The reason for the wider 
spacing of the larger radii is that the confinement effect diminishes rather quickly for 
the polygons within the length-range of this study. The polygon length of 30 is cho- 
sen so that sufficiently many different knot types populate our knot spaces and that 
we are able to identify the knot type for most polygons. For each knot space (with a 
fixed confinement radius and a given polygon length), we sampled 100, 000 differ- 
ent polygons. In total, this sample space consists of 32 sets, each containing 100, 000 
polygons (24 samples of length 30 with varying confinement and 9 samples of vary- 
ing length with confinement R = 3), yielding a total of 3, 200, 000 random polygons 
(since one sample is in both sets). 

A smaller portion of the data (1, 640, 000 polygons) is from one of our earlier 
papers [12]. We basically took the same set of radii (24 values) as described above and 
the same set of different length (9 values). For each combination of a radius and length 
10, 000 polygons where collected. However, for some of the larger lengths and smaller 
confinement radii, we could not identify the knot types reliably. The knots are too 
complex for our knot identification process to work — see the description below. Thus, 
some of these data sets were not used. We refer the reader to [12] for information about 
the exact description of the data. It is enough to know that this data set is grouped into 
subsets of size 10, 000 for each fixed (L, R) pair, where 1 < R< 4.5 and10<L< 90. 
Moreover, data points that are collected from this older data set have a larger error 
margin since the sample size is much smaller (10, 000 versus 100, 000 polygons). 


As a final remark, our combined data set contains almost five million polygons. In 
the following sections we will refer to the two data sets as the old data (the 1, 640, 000 
polygons) and the new data (the 3, 200, 000 polygons). 


In the following we briefly outline our procedure of knot identification for a given 
polygon P. The polygon P is projected onto a plane to obtain a knot diagram. Then un- 
raveller [28] is used to (potentially) simplify the crossing information via a collection of 
simplification operations based on Reidemeister moves. The code unraveller produces 
two types of output: a DT-code which is used by knotfind [19] and crossing information 


Confined knots —— 211 


which is used to compute the HOMFLYPT polynomial H using a program written by 
Ewing and Millett [18]. We have a table of HOMFLYPT polynomials for all chiral knot 
types, prime and composite, with 16 or fewer crossings (under the generally accepted 
assumption that the crossing number of composite knots is additive). For each P, we 
obtain a list of chiral knot types {Ki,..., Ks} from the table, all of which have H as 
their HOMFLYPT polynomial. Note that if this list is empty then the polygon does not 
represent any prime or composite knot with fewer than 17 crossings. In addition to the 
HOMFLYPT polynomial calculation, for each polygon we also use knotfind to compute 
the (non-chiral) knot type. We use the simplified DT-code D generated by unraveller, 
which might be the DT-code of a minimal diagram or might be the DT-code of a dia- 
gram close to the minimal diagram. Then knotfind takes D and creates a “canonical” 
DT-code D’. If D’ represents a knot within the DT-code knot table (which only con- 
tains prime knot types) then that uniquely identifies that knot (up to chirality). If both 
methods agree then we claim that the knot type of P is identified. If the two methods 
disagree, that is one method produces a knot that we should be able to identify, but 
we cannot confirm this with the other method, then we resolve this issue as follows: 
Either we start over using a different projection of the polygon P or we run another 
diagram simplification program written by some of the authors. In this way, all such 
conflicts were resolved. Thus, we claim that with very high probability we identified 
all knots that are in the current knot table [16]. 


However there are still several additional issues which we address in the following 
paragraphs. 


(i) The polygon P might represent a knot that is not prime and therefore is not in 
the DT-code knot table. At every stage of the simplification process using DT-codes, 
knotfind attempts to identify factors of composite knots. If there is an “obvious” con- 
nected sum then a part of the DT-code maps onto itself. If such a situation is detected, 
the simplification process is applied separately to each DT-code of the two factors. In 
the end, we obtain a collection of simplified DT-codes from which it is possible to re- 
construct the original composite knot type. If the HOMFLYPT polynomial of this recon- 
struction agrees with the originally computed HOMFLYPT polynomial of the polygon 
then we identify P as the appropriate composite knot. 


(ii) If the initial calculation of H indicates that P does not represent a knot in the 
table, then the simplified DT-code (or the set of simplified DT-codes in the case that 
an obvious connected sum was identified) provides an upper bound on the crossing 
number. In these cases, we double-checked by computing the HOMFLYPT polynomial 
from the simplified DT-code(s) to ensure that it matches H. We identify P as having the 
crossing number provided by the simplified DT-code. 


We observed that the approach to knot simplification based on DT-codes is ex- 
tremely reliable for the knots that are within the knot table. There is no reason to be- 
lieve that the simplification of DI-codes becomes suddenly unreliable once the actual 
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crossing number exceeds 16. Thus, we believe that a simplified DT-code, while tech- 
nically only providing an upper bound on the actual crossing number, gives a value 
that, in most cases, is actually the topological minimal crossing number. In particular, 
this is true for crossing numbers that are not far above 16 (the largest crossing number 
in the knot table). Therefore we report these approximated crossing numbers as if they 
are the actual crossing numbers. We also report a knot as prime if no composition was 
detected during the simplification process. 


10.4 An analysis of the behavior of P. with respect 
to length and radius 


Let Px(Z, R) denote the probability that a randomly selected polygon in the set of 
equilateral random polygons of length L in a confinement sphere of a fixed radius R 
has knot type X. In this section, we analyze the dependence of Px(L, R) on the poly- 
gon length L (for fixed R values) and on the confinement radius R (for fixed polygon 
lengths L). At the end of the section we view Px(L, R) as a function of both L and R. 


10.4.1 P,(L, R) as a function of length L for fixed R 


For each knot type K, it is obvious that there is a minimal length Lo such that K can 
be realized by an equilateral polygon of length Lo but not by any equilateral polygon 
of length less than Lo (Lo is called the equilateral stick number of the knot type K 
[22, 27]). Thus, Px(L, R) = 0 for L < Lo. For all the data analyzed in this study, we have 
2R < L which means that the polygons were generated under confinement pressure 
(though very light pressure for the data set for L = 10 and R = 4.5). Our numeri- 
cal results strongly support the following conjecture: for any fixed R, as L increases, 
Px(L, R) increases to a single maximum at length Lx, and then declines to zero as L 
approaches infinity, see the left of Figure 10.2. Notice that a similar phenomenon has 
been observed in [8] in the absence of confinement. We further conjecture that the 
rate of the decline is exponential in terms of some positive power of L. This conjecture 
has been proven when the polygons are not confined [9]. In that proof [9], it is shown 
that as L increases, the number of connected sum components in the equilateral ran- 
dom polygon increases with a probability that goes to one, hence the probability for 
the polygon to be of a given knot type goes to zero. Unfortunately, this argument may 
not be applicable to confined knots since a long confined random equilateral polygon 
may not have many connected sum components. However, it is quite plausible and 
intuitive that the overall knot complexity of a confined polygon increases as its length 
increases and thus knots with a lower complexity become less likely. This is strongly 
supported by our numerical results. 
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Fig. 10.2: On the left: A hypothetical curve Pc(L, R) with fixed R. On the right: A hypothetical curve 
Px(L, R) with fixed L. 


10.4.2 Px(L, R) as a function of confinement radius R for fixed L 


Here we assume that the fixed L value is large enough so that a knot of type K can 
be easily formed. Since we can only model confinement radii R = 1, we assume that 
Px(L, 1) > 0 is a positive number. We can only speculate on the behavior of Px(L, R) 
for R < 1. Clearly no knots can exist for R < 1/2, so Px(L, R) = O for R < 1/2. However, 
the behavior of Px-(L, R) for 1/2 < R < 1 could be different for different knot types. 
Extreme confinement may favor certain knot types, so it is conceivable that, for some 
knot type K, we have Px(L, R) = 0 for all R < Ro for some value Ro > 1/2. Moreover 
the value Ro could depend on the knot type and have different values for different 
knot types. Since we have no data for R < 1, we only stipulate that Px(L,1) > 0 
is a positive number in our model. Just as in the case of dependence on length, we 
propose that with increasing radius P4,(L, R) rises toa single maximum at radius Rx, 
and then declines. However, this time Px(L, R) becomes a positive constant U;(K) 
once R becomes large enough (say R > L/2). This is because, for large R, the effect 
of confinement disappears and P.(L, R) is simply the probability for an unconfined 
equilateral random polygon of length L to be of knot type X. By our initial assumption 
on L, this probability is positive and we denote it with U;(X). 


10.4.3 Modeling Px as a function of length and radius. 


Based on known rigorous results and our numerical results, we propose the following 
model function for Px (L, R): 


7 L=1g00\* a: 
Px(LsR)=a(a+ (Ae A0) P|“ pR=e }* 
where Lo(X) is the equilateral stick number of K, L > Lo(K), R>1, R > c/b, anda, b, 


c, d, e > O are positive constants depending on the knot type K. The condition R > c/b 
insures that the exponential term decreases with L. In addition we set Px(L, R) = 0 
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for L < Lo(X). Notice that for a given fixed R value, P3(L, R) is a function of L with 
the following properties: 

1. limz_,.. Px (LZ, R) = 0; and 

2. P«(L, R) has a single maximum point. 


On the other hand, for a fixed L value, Px (L, R) is a function of R with the properties: 


1. limp_,.. Px(L, R) = ad > 0; and 
2. Ps(L, R) has a single maximum point. 


10.5 Numerical results 


10.5.1 The numerical analysis of P(L, R) based on the old data 


We fitted the above model function P3.(L, R) to knot data of several knots up to seven 
crossings. Here we used the older data set, where each data point is based on a sample 
of 10, 000 polygons. Three examples are shown in Table 10.1 with the corresponding 
fitting parameters and R? values: 


Tab. 10.1: The parameters for the model P(L, R) for various knot types 


parameter 31 5 62 
a 0.000205 2.17277x10-’ 1.73334 x 107? 
b 5.95573 4.28009 3.85288 
c 2.2024 1.3623 1.31161 
d 124.299 9736.84 335759. 
e 3.43966 5.66256 7.09841 
R2 .996 .986 982 


Figure 10.3 is a plot of the fitting function P3,(L, R) together with the actual data 
points from two different view points. Here Lo(31) = 6. The maximal z-value of the data 
points is = 0.21 and the average value is = 0.108. If we compute the absolute value of 
the difference between the z-values of the data points and the best fit function, we get 
a maximal value of ~ 0.026 and an average value of = 0.0067. 


In Figure 10.4 we show the fitting functions for Ps,(L, R) and P¢,(L, R) together 
with the actual data points. Note that for both of these knots Lo(52) = Lo(62) = 8 
[27]. The maximal z-value of the data points for 55 (62) is = 0.059 (= 0.027) and the 
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Fig. 10.3: Two different views of the surface P3,(L, R) together with the actual data points. The 
slightly different coloring of the data points indicates if the points are above or below the surface. 
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Fig. 10.4: Left: The plot of the fitting function Ps,(L, R) together with the actual data points; Right: 
The plot of the fitting function P¢,(L, R) together with the actual data points. The slightly different 
coloring of the data points indicates if the points are above or below the surface. 
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average value is ~ 0.022 (= 0.0088). The maximum difference between the z-values 
of the data points for 5> (62) and the best fit function is ~ 0.011 (= 0.0055) while the 
average difference is ~ 0.0026 (= 0.0011). 
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Fig. 10.5: Comparison of the fitting function P4(L, R) obtained from the newer data and the older 
data for the knots 34, 41, 51, and 52. Left: Fixed R; Right: fixed L. The solid curves are based obtained 
by connecting the new data points by line segments. The circles are the old data and the dashed 
lines represent a cross section through the surface P(L, R) fitted to the old data. 


10.5.2 The numerical analysis of Px(L, R) based on the new data 


Overall, the newer and larger data sets did not produce any surprises in terms of the be- 
haviors of the Px(L, R) functions, in fact there is an amazingly good agreement. Here 
we show two examples. In Figure 10.5 on the left we compare the new data points with 
the old data points and with the slices for R = 3 from the surfaces P«(L, R) obtained 
from the old data. The figure shows the data for the first four non-trivial knot types in 
the knot table. Here each round data point is based on the old data set (from a sam- 
ple of 10, 000 polygons), whereas the solid lines are connecting the data points from 
the new data (from a sample of 100, 000 polygons). Each of the two dashed curves 
shows the slice for R = 3 of Px(L, R) when fitted to the 164 data points of the old 
data for knot types 3; and 4;. Similarly, Figure 10.5 on the right shows a comparison 
for L = 30 based on the new data (solid curves) with their counterparts based on the 
old data (circles) and of two fitting functions P4(L, R) (dashed curves) based on the 
old data. The R? values for the fitting functions based on the new data points are all 
20.999. 


10.5.3 The location of local maxima of Px(L, R) 


The maximal points of P4(L, R) for a fixed knot type X lie on a line in the (L, R) plane 
as shown in Figure 10.6 on the left using the example of the four knots 31, 41, 52, and 
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62. The points shown in the figure are computed from the model Px(L, R) obtained 
from fitting the older data. The slope of these lines increases with knot complexity 
from about ~ 20.41 for the trefoil knot to about ~ 27.35 for the knot 6. Furthermore, 
for a fixed radius R, it takes more and more length for the maximum to occur as the 
knot complexity increases. 
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Fig. 10.6: The maxima of the fitted surfaces P(L, R) from the older data for the knots 31, 41, 52, 
and 62. Left: Dependence of the length on where the maxima occur on R. Right: Dependence of the 
magnitude of the maxima on R. 


In Figure 10.6 on the right, we show the relationship of the maximal value of 
P(L, R) and the (L, R) plane using the same four examples of 31, 41, 52, and 69. 
Here we see that the height of the maxima rapidly declines as the knot complexity 
increases. It appears that as the radius (and length) increases, the maximal value of 
Px(L, R) initially drops slightly and then remains roughly constant. In Figure 10.7 we 
show each of these curves in a coordinate system by itself and now it appears that the 
value of each maximum of P4,(L, R) slightly increases for increasing R after the initial 
drop. 


10.6 The influence of the confinement radius on the 
distributions of knot types 


In this section we consider only the new data, and we would like the reader to recall 
that these are all polygons of length 30. A recurring observation made in this section 
is that the relative distribution of knots in a given group of knots (e.g the group of 
7-crossing alternating knots, or the 8-crossing composite knots) is remarkably steady 
for many different confinement radii. The last few subsections in this section illustrate 
this visually. 

Many of the figures presented in Sections 10.6 and 10.7 contain error bars to in- 
dicate the reliability of the data. Usually the error bars are only added to some of the 
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Fig. 10.7: The plot of the heights of the maxima of the fitting function Px(L, R) depending on R for 
the knots 3; (top left), 4, (top right), 52 (bottom left) and 62 (bottom right). 


curves of each graph so that the graph remains intelligible. The error bars were deter- 
mined using the standard interval estimation for binomial proportions (see [3]). The 
standard confidence interval is computed as CIs = p + Za). n-/? (6G)'/? with p = * 
where n is the sample size, X is the number of successes (occurrences of knots of in- 
terest) in the sample and q = 1 - p. The value Z,/. is the 100(1 — a/2)-th percentile of 
the standard normal distribution. In this article we use a = 0.05. 


10.6.1 3-, 4-, and 5-crossing knots 


Our data contains 501,838 knots with three, four, or five crossings. There are 
322, 609 3, knots, 85, 134 4, knots, 35, 184 5, knots, and 58, 911 5> knots. There is 
a maximum of 21, 291 knots at radius R = 1.0 and a minimum of 6, 951 knots at ra- 
dius R = 4.5. We note that, for all radii, the order of the knot types (from least frequent 
to most frequent) is consistent, as shown in Figure 10.5 on the right. 


10.6.2 6-crossing knots 


Our data contains 69, 080 6-crossing knots (all polygons are of length 30). There are 
17, 940 6, knots, 21,077 62 knots, 12, 620 63 knots, and 17, 443 3,#3, composite 
knots. There is a maximum of 7, 065 knots at radius R = 1.0 anda minimum of 205 
knots at radius R = 4.5. We note that we do not distinguish the square and the granny 
knot — instead the two different composite knots are grouped together as 3,#3;. The 
four functions P4(30, R) are shown in Figure 10.8 on the left. The figure shows that 
the order of knot types is not the same for all radii, unlike for the smaller knot types 
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shown in Figure 10.5 on the right. The order of the four knot types seems to be the 
same for the smaller radii 1 < R < 1.5, while it is hard to see what happens for the 
larger radii. In order to make this more visible, we show the relative percentage of the 
four different knot types, see Figure 10.8 on the right. Here, for each fixed R-value, 
the y-values of the four data points add up to one. This kind of normalization is done 
frequently in the following sections of this article. In order to make the differences of 
probabilities between different knot types more visible for a particular family of knots 
(for example knots with a fixed crossing number or knots with a particular property 
such as 9-crossing composite knots) we scale the actual percentages such that the sum 
of the scaled percentages adds up to one for all knots in this family. This new quantity 
is called relative percentage and in Figure 10.8 on the right we see this for the first 
time. Most noticeable in this figure is the relative decline of the composite knots as 
the confinement radius decreases. This is not really surprising as it has been shown 
in [12, 16] that tighter confinement suppresses composite knots. We also observe that 
the order of the alternating knots (from least frequent to most frequent) is consistent, 
except when the error bars become too large, and that the knot 63 occurs with the 
smallest frequency. This is due to the fact that 63 is achiral and achiral knots have 
been observed with a lower frequency than chiral knots of the same crossing number 
{12]. 
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Fig. 10.8: On the left: The percentage (or actual probability) of the four different 6-crossing knots. 
On the right: the relative percentage distribution of the four knot types. For a fixed R, the frequen- 
cies of all data points for that R add up to one. 


10.6.3 7-crossing knots 


Our data contains 40, 071 7-crossing knots. There are 2, 587 7, knots, 5, 125 72 knots, 
4, 763 73 knots, 2, 835 7, knots, 6, 497 7, knots, 7,665 7. knots, 3, 717 77 knots, 
and 6, 882 3,#4, composite knots. There is a maximum of 5, 775 knots at radius 
R = 1.1 (there are 5,745 at radius R = 1.0) and a minimum of 47 knots at radius 
R = 4.5. Since the values of Px(30, R) are quite small, we only display the relative 
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percentages of all 7-crossing knots in Figure 10.9. As before, we observe that the rel- 
ative percentage of the composite knot 3,;#4, declines with decreasing confinement 
radius. In addition, for radii R < 2, the order of the relative percentage of the 7-crossing 
prime knots seems to be independent of the radius. The knot 7¢ is the most frequent, 
followed by 75, 72 or 73, 77, and 7, or 7,1. At this point, we have no explanation of 
why the data shows this ordering of the knot types. We have compared this order with 
the order of knot energies [23] and ropelength [5], however we found no correlation at 
all (see Table 10.2). 


Tab. 10.2: The order of 7-crossing knots based on various measures. 


knot P(30,R) ropelength Mébius energy approximation 


71 7 1 1 
72 30r4 3 2 
73 30r4 2 3 
74 5or6é 4 4 
75 2 5 5 
76 1 7 6 
77 5or6é 6 7 
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Fig. 10.9: The relative percentage of the different 7-crossing knot types on the vertical axis as a 
function of the confinement radius R. 


10.6.4 8-crossing knots 


In the following, we investigate the knot type distribution for 8-crossing knots. Our 
sample contains 42, 711 knots with eight crossings, with a maximum of 7, 301 knots 
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at radius R = 1 and a minimum of 38 knots at radius R = 4.5. As the number of knot 
types increases, the graphs become more and more complicated. There are 21 prime 
knot types and 3 composite knot types to consider. A figure similar to Figure 10.9 would 
be too crowded and so we display the information in a different way. 

First, we claim that the knot type distributions are remarkably consistent regard- 
less of the radius of confinement. In other words, while the exact value of the rela- 
tive percentage is not the same, the order of the knot types (from smallest to largest 
relative percentage) is often the same. To support this, we no longer show a curve 
for each knot type versus the radius since this would be 24 curves (21 prime and 3 
composite). Instead we combine the knots of 4 consecutive radii and show one curve 
for each group of radii reducing the number of curves to plot to six. That is we com- 
bine R = {1.0,1.1,1.2, 1.3}, R = {1.4,1.5,1.6,1.7}, R = {1.8,1.9, 2.0, 2.1}, 
R = {2.2, 2.3, 2.4, 2.5}, R = {2.6, 2.7, 2.8, 2.9}, and R = {3.0, 3.5, 4.0, 4.5}. For 
each of the groups of four radii, we tabulate the distribution of each of the 24 knot 
types by using relative percentages as before. Thus for each of the six curves the total 
sum of the values over all 8 crossing knot types equals to one. In effect, in Figures 10.10, 
10.11, and 10.13 the information plotted vertically for each knot type is a compressed 
version of what Figure 10.9 shows (for 7-crossing knots) with each curve. Finally, we 
split the data into two groups, and display the data for the knot types on the x-axis, 
see Figures 10.10 and 10.11. 
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Fig. 10.10: The relative percentages of the different 8-crossing knots from 8; to 813 grouped into six 
groups by the different radii. In each group, the smallest radius is shown as a plot label. 


The most striking feature of Figures 10.10 and 10.11 is how much more likely the 
non-alternating knot types 819, 820, and 82; are when compared to all other knot types 
(prime and composite). This property is independent of the radius of confinement. 
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Fig. 10.11: The relative percentages of the different 8-crossing knots from 813 to 41#4; grouped into 
six groups by the different radii. In each group, the smallest radius is shown as a plot label. 


However, the larger confinement radii favor non-alternating knots even more than the 
smaller confinement radii. If one looks closely at Figure 10.11 then one can see that for 
the non-alternating knots 819, 829, and 82, the order of the curves (from the small- 
est relative percentage to the largest relative percentage) is by the size of the radius. 
That is, the smallest group of confinement radii leads to the least relatively frequent 
occurrence of the knot type and the largest group of confinement radii leads to the 
most relative frequent occurrences of the knot types. For prime knots, the order of the 
curves is often the opposite, the curve representing the smallest group of confinement 
radii is on the top for many knots and the curve with the largest group of confinement 
radii is on the bottom. We suspect that this inconsistency in the order of curves maybe 
due the fact that our sample is not large enough, so that for knot types that have a 
much smaller frequency, the sample error could be large enough to create a different 
order. For the three composite knots 3,#5,, 3,#52, and 4,#4, in Figure 10.11, we no- 
tice that the curve for the largest radii is on the bottom (smallest relative percentage). 
However, the order of the other curves is not clear and the data points are very close 
to each other. 

The least likely knot type is 8:3 whose standard diagram is based on the basic 
Conway polyhedron 8", which has already been observed in [12]. More precisely, the 
knot 81g appears only 49 times in our sample, while the next lowest occurrence is an 
order of magnitude larger (the knot 83 with 557 cases). The knot that occurs the most 
is 829 with 6, 964 cases. In order to get more insight, we look at some subsets of the 
knot types. Figure 10.12 on the left shows that 3,#5> is more likely than 3,#5,, which 
in turn is more likely than 4,#4,. The curve for 3,#5, has statistical error bars which 
are similar to those of the other two curves. This distribution of the composite types 
is expected since 53 is more likely than 5, and the composite 4,#4, is achiral, and 
therefore less likely. The figures hint that this distribution is largely independent of 
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the confinement radius. Figure 10.12 on the right shows the frequencies of the three 
non-alternating knot types 819, 829, and 851. The figure shows that 829 is more likely 
than the other two, which have about the same frequency, with 85, being slightly more 
likely than 849. 
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Fig. 10.12: On the left, the relative percentages of 8-crossing composites versus the confinement 
radius. On the right, the relative percentages of 8-crossing non-alternating prime knots versus the 
confinement radius. 


10.6.5 9-crossing knots 


In Figure 10.13, we show the 9-crossing knot spectrum in the same way as we showed 
the knot spectrum for the 8-crossing knots. Since there are many more knot types (49 
prime and 6 composite) the knot types are not labeled on the horizontal axis. However 
we indicate the extent of the three knot groups: prime alternating knot types, prime 
non-alternating knot types, and composite knot types. Our sample contains 26, 742 
knots with nine crossings, with a maximum of 5, 665 random knots at radius R = 1 
and a minimum of 18 knots at radius R = 4.5. The total of 26, 742 random knots is 
about 63 % of the number of 8-crossing random knots while the number of knot types 
is twice as large (24 versus 55 knot types). Thus as expected, the fluctuations due to the 
smaller random sample muddle the picture. Nevertheless, the curves (maybe with the 
exception of the curve for the largest radii) are still roughly parallel. Here, by roughly 
parallel, we mean that the six curves could be put into a relatively narrow band — 
it does not mean that the curves cannot intersect each other. The reader should also 
keep in mind that the smaller the radius the more actual knots are represented by such 
a curve. Thus the curve representing the largest radii has the largest statistical error, 
which is reflected by the fact that it often crosses through all the other curves. The 
smallest numbers of knots occur for 949 (6 knots), 941 (39 knots), 935 (46 knots) and 
93,4 (60 knots). We also note that the standard diagrams of the knots 949 and 934 are 
based on the Conway basic polyhedra 9* and 8" respectively, see also [12]. The largest 
numbers of knots occur for 9, (2, 604 knots), 94, (2, 585 knots), 945 (2, 028 knots), 
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and 9,3 (1, 799 knots). The next knot in the frequency list drops by more than 600. As 
in Figure 10.11, there is a large increase in the number of non-alternating knots. The 
same observations we made for 8-crossing knot types about the order of the different 
curves according to size of the confinement radius could be true. However, the picture 
is less clear, which could be due to the larger sample variance caused by the larger 
number of knot types and the smaller overall knot sample. 


9 crossings 


Fig. 10.13: The relative percentages of the different 9-crossing knots grouped into six groups by the 
different radii. In each group, the smallest radius is shown as a plot label. 


In order to get more insight, we investigate some subsets of the knot types: com- 
posite knot types and non-alternating knot types. Figure 10.14 on the left shows that 
31#6> is more likely than 3, #6,, which in turn is more likely than 3, #63. This is as ex- 
pected since 6) is more likely than 6,, which in turn is more likely than 63. We also see 
that 4,#5> is more likely 4,#5,, which is as expected since 5 is more likely than 5,. 
The curve for 3,;#31#3, has statistical error bars which are similar to the other curves. 
The frequency of 3,#3,#3, is about the same as that of 4,#5,. This shows that this 
distribution is largely independent of the confinement radius. Figure 10.14 on the right 
shows the relative frequencies of the eight non-alternating knots 94) to 949. It shows 
that 947, 94g, and 949 are much less likely than any of the others. We also note that a 
standard diagram of the knots 9,7 is based on the Conway basic polyhedra 8”. 
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Fig. 10.14: On the left, the relative percentages of 9-crossing composites versus the confinement 
radius. On the right, the relative percentages of 9-crossing non-alternating prime knots versus the 
confinement radius. 


10.6.6 10-crossing knots 


In Figure 10.15 we show the relative percentages of the 10-crossing spectrum. Since 
there are many knot types (165 prime and 14 composite), the knots are not labeled on 
the x-axis. Our sample contains 24, 277 knots with 10 crossings, with a maximum of 
6,021 knots at radius R = 1 and a minimum of 9 knots at radius R = 4.5. There are 
about as many 10-crossing random knots as 9-crossing random knots but the number 
of knot types is more than three times as large. Thus, we expect the fluctuations due 
to the smaller random sample to be even larger than it was for 9-crossing knots. There 
are very few knots for the larger radii, thus it makes very little sense to use the data 
for the large radii. As a result, in Figure 10.15 we only show three curves, the first for 
radii R = 1.0- 1.3 contains 18, 118 random knots, the second for radii R = 1.4 - 1.7 
contains 4, 647 random knots, and the third for radii R = 1.9 - 2.1 contains 1, 004 
random knots. (The fourth would only contain 310 knots and therefore does not yield 
a useful distribution.) 

If we look at the five knots that have the lowest frequency, then four of these have 
a standard diagram based on Conway basic polyhedra: 10373 (2 knots and 10” polyhe- 
dra), 10115 (3 knots and 8" polyhedra), 1012; (4 knots and 9" polyhedra), and 10122 (4 
knots and 9* polyhedra). In fifth place is 1099 (8 knots) which is not based on Conway 
basic polyhedra. As in the previous figures, we can see the large increase in the num- 
ber of non-alternating knots. The non-alternating knots with the largest frequencies 
are as follows: 10132 (1007 knots), 10133 (762 knots), 10124 (596 knots), 1012 (596 
knots), and 10437 (585 knots). 

It is impossible to see in Figure 10.15 if the curves are still roughly parallel. To get 
a feeling of how similar the curves are, in Figure 10.16 we show a zoomed in version 
of the first two curves in Figure 10.15 using only the alternating knots of 10 crossings. 
Here we can see that there is still a lot of similarity between the two curves. 

The error bars for most of the individual knot types are too large to allow definite 
conclusions. We illustrate this by just looking at the 10-crossing composites in Figure 
10.17. The figure on the right shows the 10-crossing composite knots which includea 3; 
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Fig. 10.15: The relative percentages of the different 10-crossing knots grouped into three groups by 
the different radii. In each group, the smallest radius is shown as a plot label. 
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Fig. 10.16: A zoomed in version of Figure 10.15 showing only the relative percentages of the different 
alternating 10-crossing knots using the first two groups of the different radii. 
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component. Their relative percentages seem to be determined by the relative percent- 
ages of the associated 7-crossing knots, with 7, and 7, the most likely, causing 3, #7, 
and 3,#7. to be the most likely composite knots. Similarly, 7, and 7, are the least 
likely, causing 3;#7, and 3,#7, to be the least likely composite knots in the graph 
(see Figure 10.9 and Table 10.2). On the right part in Figure 10.17, a similar observation 
holds for 10-crossing composite knots which contain a 4, component. 
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Fig. 10.17: On the left, the relative percentages of 10-crossing composites with 3:1, and, on the right, 
the relative percentages of 10-crossing composites with 4; versus the confinement radius. 


10.7 The influence of polygon length on the 
distributions of knot types in the presence of 
confinement 


In this section, we discuss the dependence of the distribution of knot types on length 
in the presence of confinement. We want to remind the reader that we consider only 
the new data, and all polygons are in a sphere with confinement radius R = 3. This 
investigation is similar to that of Section 10.6. Intuitively, we expect that keeping the 
confinement radius fixed and increasing the length of the polygons has an analogous 
effect to keeping the length of polygons fixed and decreasing the confinement radius. 
Thus many of the results are similar to those of the last section. This includes the ob- 
servations that the relative distribution of knots ina given group of knots is remarkably 
steady for many different confinement radii. 


10.7.1 3-, 4-, and 5-crossing knots 


Our data contains 184,517 knots with three, four, or five crossings. There are 
117, 059 3, knots, 31, 827 4; knots, 13, 075 5, knots, and 22, 556 5> knots. There is 
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a maximum of 32, 711 knots at length L = 70 and a minimum of 666 knots at length 
L = 10. We note that for all lengths the order (from most frequent to least frequent) of 
knot types is consistent, as shown in Figure 10.5 on the left. 


10.7.2 6-crossing knots 


Our data contains 33, 161 6-crossing knots (all in a confinement sphere of radius R = 
3). There are 7, 351 6, knots, 8, 311 6) knots, 5,107 63 knots, and 12, 392 3,#3, 
composite knots. There is a maximum of 7983 knots at length L = 90 anda minimum 
of 2 knots at length L = 10. The four functions P,(L, 3) are shown in Figure 10.18 on 
the left. The figure shows that the order of the four knot types remains unchanged, 
with 3, #3, having the highest frequency, followed by 62, 61, and 63 with the lowest 
frequency. Figure 10.18 on the right shows the relative frequency of the four different 
knot types. Here for each fixed value L, the y-values of the four data points add up to 
one. 


The most noticeable difference to the case in the previous section is the relative 
stability of the knot distribution, that is for length L = 30 the four functions in Figure 
10.18 on the right look basically constant. The previous section shows that stronger 
confinement causes a relative decline in the number of composite knots leading to 
the expectation of a decline of composite knots for increased polygon lengths (and 
fixed confinement radius). Figure 10.18 indicates that this is not the case. It has already 
been observed in [12, 16] that for composite knots, the analogy between decreasing 
the radius of the confinement sphere for polygons and increasing the length of the 
polygons in a fixed confinement sphere breaks down. We also observe that the relative 
order of the prime knot types is the same in both cases (see Figure 10.8 for the smaller 
confinement radii) and that the achiral knot 63 occurs with the smallest frequency. 
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Fig. 10.18: On the left: The relative percentages of the four different 6-crossing knots versus length. 
On the right: the relative percentages of the four knot types versus length. 
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10.7.3 7-crossing knots 


Figure 10.19 shows the dependence of the distribution of 7-crossing knot types in the 
presence of a confinement sphere of radius R = 3. Our sample contains 19, 971 7- 
crossing knots. There are no 7-crossing knots of length 10 in our sample and the data 
starts with 15 knots of length 20, rising to 5, 587 knots of length 90. The least likely 
knot is 74, which occurs 1063 times in the sample, while the composite knot 3, #4 
occurs with the largest frequency of 5, 769. It appears that the relative distribution 
of 7-crossing knots varies only slightly with the length of the polygons for L = 40. 
From the data for 6-crossing knots (see Figure 10.18), one can speculate that the rela- 
tive distribution of 7-crossing knots is also independent of the lengths of the polygons. 
There are significant similarities between this relative percentage distribution and the 
data for the dependence of the relative percentage of 7-crossing prime knots on the 
radius of confinement as shown in Figure 10.9: The curves show that 7¢ is the most 
frequent, followed by 7;. Next are 72 and 73 which are again approximately equally 
likely. This is followed by 77 and at the end we have 7, and 7, (again approximately 
equally likely). Just as in the case of the 6-crossing knots, the difference between the 
two distributions shown in Figures 10.19 and 10.9 is the relative percentage of the com- 
posite knot 3,#4,, which for small radii has a frequency similar to 7; (or 7¢), but for 
radius R = 3 is the most likely. 


7 crossing relative precentage; R = 3.0 


relative % 
0.30 - 


Fig. 10.19: The relative percentages of the different 7-crossing knot types versus length. 


10.7.4 8-crossing knots 


Next we look at 8-crossing knot types in the presence of a confinement sphere of radius 
R = 3. Our sample contains 20, 921 8-crossing knots. There is a single knot of length 
10, 12 knots of length 20, rising to 6, 475 knots of length 90. The least likely knot is 
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81g which occurs 19 times in the sample, while the composite knot 3,#5 occurs with 
the largest frequency of 3, 486. The prime knot with the largest frequency is the non- 
alternating knot 859, which occurs 2, 521 times. 

It again appears that the relative distribution of 8-crossing knots varies little with 
the length of the polygons once the polygons are long enough. Data supporting this 
claim is presented in several graphs. For Figures 10.20 and 10.21, all polygons up to 
length 50 are combined into one group since, for the smaller lengths, there are not 
many 8-crossing knots. As before we show relative percentages, that is for each curve 
(representing different lengths) the sum of the percentages over all knot types adds up 
to one. Similar to the data for the dependence of the relative percentages of 8-crossing 
knots on the radius of the confinement sphere, there are many more non-alternating 
prime knots than alternating prime knots, although there are also many more compos- 
ite knots. Just as in the case of 7-crossing knots, the composite knots behave differently 
with increasing polygon length than they do with decreasing confinement radius. We 
also note that for the non-alternating prime knots, the curves are strictly ordered by 
length, with the shortest length leading to the largest relative frequency (on top) and 
the longest length leading to the lowest relative frequencies (on the bottom). The be- 
havior for the alternating prime knots and composite knots is inconsistent but the 
order of curves is mostly reversed. Thus with the exception of the composite knots, 
this behavior is similar to the behavior of the 8-crossing knot types with a changing 
confinement radius. 
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Fig. 10.20: The relative percentage of the 8-crossing knots from 8, to 833 grouped into six groups by 
the different lengths. 


Figure 10.22 shows the relative percentages of non-alternating knots (right) and 
the composite knots (left). It is remarkable how close the relative percentages for 
composite knots of length 90 (R = 3) ({0.32, 0.58, 0.1} rounded to the nearest per- 
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Fig. 10.21: The relative percentages of the 8-crossing knots from 813 to 4:#41 grouped into six 
groups by the different lengths. 


cent) are to the relative percentages for composite knots at radius R = 1 (LZ = 30) 
{0.34, 0.56, 0.1}. For the non-alternating knot types, the data for polygons of length 
90 is (R = 3) {0.21, 0.49, 0.30} compared to (L = 30) {0.24, 0.47, 0.29} for length 
30 polygons in confinement of radius R = 1. 
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Fig. 10.22: The relative percentages of the different 8-crossing knot types on the vertical axis and 
its dependence on the polygon length. On the left: composite knots; on the right: non-alternating 
knots. The frequencies are normalized so that adding all values for a fixed length yields one. 


For alternating prime knots of eight crossings, the picture is not clear since the 
error bars are too large to draw reliable conclusions about the frequency of all knot 
types. This is shown in Figure 10.23. Clearly, the knots that are the most frequent, 83 
and 8,,, are also on the top of the knot type distribution for a fixed length and varying 
confinement radius, see Figures 10.10 and 10.11. 
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Fig. 10.23: The relative percentages of 8-crossing alternating prime knot types versus length. 


10.7.5 9-crossing knots 


Our sample contains 14, 406 9-crossing knots. There are no knots of length 10, 4 knots 
of length 20, rising to 4, 913 knots of length 90. The least likely knot is 949 which oc- 
curs 6 times in the sample, while the composite knots 3,#6, and 3, #62 occur with the 
largest frequency of 1,007 and 1,115 respectively. The prime knot with the largest 
frequency is the non-alternating knot type 944, which occurs 971 times. The most fre- 
quent alternating prime knot is 9g, which occurs 307 times. There are 5, 925 alternat- 
ing prime knots (9, to 9,1), 3, 836 non-alternating prime knots (94 to 949), and 4, 645 
composite knots. This shows again that the frequency of non-alternating knot types 
and composite knot types is larger than the frequency of alternating prime knot types. 
Similar to the last two subsections, the data suggests that the relative distribution of 
9-crossing knots varies little with changes of the polygon length once the polygons are 
long enough (L = 50). Data to support this claim is presented in two graphs. Figure 
10.24 shows the relative percentage of non-alternating knots (right) and of composite 
knots (left). It is remarkable how close the percentages for composite knots of length 
90 (and radius R = 3.0), rounded to the nearest percent, are to the percentages for 
composite knots at radius R = 1 (and length L = 30) (see Table 10.3). Just like in the 
section about the dependence of the knot distribution on the confinement radius, the 
order of the relative frequencies of 9-crossing composite knots is strongly related to 
the relative frequencies of the 6- and 5-crossing prime knots. Similar results are ob- 
tained for the non-alternating prime knot types of length 90 in confinement of radius 
R = 3 compared with length 30 polygons in confinement of radius R = 1 (see Table 
10.3). 
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Tab. 10.3: Similar relative percentages for polygons of length 90 (and radius R = 3.0) and polygons 
in confinement radius R = 1.0 (and length L = 30). On the left for composite knots and on the right 
for non-alternating knots. 


knot length90 R=1 | knot length90 R=1 

R=3 L=30 R=3 L=30 

31#31#31 0.17 0.07 942 0.24 0.22 
31#61 0.22 0.22 943 0.18 0.16 
31#62 0.25 0.27 944 0.24 0.25 
31#63 0.14 0.20 945 0.18 0.21 
4i#5 0.08 0.08 946 0.10 0.08 
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Fig. 10.24: On the left the relative percentages of different 9-crossing composite knot types versus 
length. On the right, the relative percentages of different 9-crossing non-alternating prime knot 
types versus length. 


10.7.6 10-crossing knots 


Our sample of knots with 10 crossings contains 12, 326 knots. There are no knots of 
length 10, one knot of length 20, rising to 4, 739 knots of length 90. The least likely 
knot is 10,33, which is the only knot that does not occur in the sample. A minimal 
standard diagram of 10123 is based on the 10* basic Conway polyhedron. We note 
that this is the only knot type with less or equal to 10 crossings that was not observed 
in the old data set either [12]. This confirms the observation that knots whose minimal 
standard diagrams are based on basic Conway polyhedrons occur with a probability 
that is much lower than the probability of other knot types [12]. The composite knot 
3,#3,#4, occurs with the largest frequency of 457. The prime knot with the largest fre- 
quency is the non-alternating knot 10132, which occurs 379 times. The most frequent 
alternating prime knot is 1029, which occurs 81 times. There are 3, 751 alternating 
prime knots (10; to 10123), 5, 520 non-alternating prime knots (10124 to 10165), and 
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3, 055 composite knots. This shows, just as for other crossing numbers, that the fre- 
quency of non-alternating knot types and composite knot types is larger than the fre- 
quency of alternating prime knot types. The relative distributions of 10-crossing knots 
in our data seems to vary more than for other graphs with the length of the polygons 
(once the polygons are long enough), however this is due to the large error bars. Thus, 
the authors claim that for a large enough sample, the statements of the previous sec- 
tions would still hold: The relative distributions vary only slightly with changes in the 
length of the polygons. To support this, we show the non-alternating knots and the 
composite knots. The reader should compare Figure 10.25 with Figure 10.17 to follow 
the discussion in this paragraph. In the figure on the left are the composite knots with 
a 3, component. Their frequency seems to be determined by the frequency of the 7- 
crossing knots, with 75 and 7¢ the most likely knots, causing 3,#75 and 3,#7¢ to be 
the most likely composite knots. Similarly 7, and 7, are the least likely knots, caus- 
ing 3,#7, and 3,#7, to be the least likely composite knots, see Figure 10.9. On the 
right are the results for composite knots with a 4, component. Here we would expect 
that 4,#6,; occurs with the highest frequency — but this does not always happen. How- 
ever, as the error bar on the knot 4, #6) indicates, the curves do not allow any definite 
conclusions. 
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relative % 
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Fig. 10.25: On the left, the relative percentages of 10-crossing composites with a 31-component ver- 
sus length. On the right, the relative percentages of 10-crossing composites with a 4;-component 
versus length. 


10.8 Conclusions 


We finish this chapter with several conjectures. Here R is the radius of confinement 
and n is the length of a polygon P in our sample. We also want to remind the reader 
that the evidence collected in this article is restricted to 1 < R< 9/2 and10<L< 90. 
Furthermore our sample does not contain enough knots for large crossing numbers 
Cr > 10 to draw conclusions about the frequencies of individual knot types. 
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e The distributions of different prime knot types are virtually independent of the 
confinement radius and length. To be more precise, we claim that if K, and K, are 
two prime knot types in a knot space Sc, (that is both knots have the same cross- 
ing number Cr) and if Px,(Lo, Ro) << Px,(Lo, Ro) for some confinement radius Ro 
and some length Lo (where Lo is big enough so that both knots can be formed) then 
Px, (L, R) << P,(L, R) for all confinement radii R = 1 and all lengths L > Lo. Simi- 
larly if Px, (Lo, Ro) = Px, (Lo, Ro) for some confinement radius Ro and some length Lo 
(where Lo is big enough so that both knots can be formed) then Px, (L, R) = P3,(L, R) 
for all confinement radii R = 1 and all lengths L = Lo. 


¢ The above property also holds if both knot types in Sc, are composite knots. 


e The relative probabilities of a composite knot type versus the probabilities of 
a prime knot type are virtually independent of the length but not independent of 
the confinement radius. To be more precise, we claim that if K, is a prime knot and 
Kz a composite knot in a knot space Sc, (that is both knots have the same cross- 
ing number Cr) and if Px,(Lo, Ro) << Px,(Lo, Ro) for some confinement radius Ro 
and some length Lo (where Lo is big enough so that both knots can be formed) then 
Px, (L, Ro) << Px,(L, Ro) for alllengths L = Lo. Similarly if Px.,(Lo, Ro) = Px,(Lo, Ro) 
for some confinement radius Ro and some length Lo (where Lo is big enough so that 
both knots can be formed) then Px, (L, Ro) = Px,(L, Ro) for all lengths L = Lo. 


¢ Fora fixed crossing number Cr the probability that a polygon represents a knot in 
Sc, eventually decreases in strong confinement. That is to say, provided that the length 
L is long enough so that K can easily be formed, the function Px (L, R) is initially an 
increasing function in R considering R = 1/2 as a starting point. For composite knots 
K this increase is more pronounced than for prime knots K. 
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Kyle Chapman, Kenneth C. Millett, and Laura Plunkett 
Sampling Spaces of Thick Polygons 


Abstract: Open and closed polygons provide an attractive coarse grained model for 
many molecular structures. The random selection of polygons of a specified thick- 
ness is, however, an objective that has not been achievable until very recently. Here, 
we give an elementary description of the distinct sampling strategies that have been 
employed, their limitations, and the new algorithms that now allow one to randomly 
sample the spaces of open and closed polygons with specified thickness. We observe 
that the introduction of even a very modest thickness has an immediate and profound 
effect on the shape, the size, and the type of knot formed. 


11.1 Introduction 


A polygon in three dimensional Euclidean space consists of a finite sequence of 
straight line segments, called edges, meeting only at the endpoints of adjacent edges, 
called vertices. Polygons are closed if they form a loop and are, otherwise, open. Anal- 
ogous to the ball and rod molecular models, polygons have provided scientifically use- 
ful coarse grained models of locally linear molecular structures. These polygons have 
provided productive coarse grained models of a wide range of molecules: rubber, am- 
ber, polyethylene, or other polymer melts or individual DNA chains, or proteins. In 
some cases, the edge lengths are allowed to vary in length according to a given dis- 
tribution and, in others, they are required to have a constant length, in some cases 
reflecting the monomers and bonds of the structure while, in others, reflecting the 
homogeneity of the persistence (or Kuhn) length of the modeled macromolecule. Sim- 
ilarly, the bending angles between adjacent edges reflect the curvature character of 
the molecule and a triple of adjacent edges can be used to measure the intrinsic tor- 
sion of the chain. Another facet of the modeling challenge is to capture the thickness 
of the macromolecule or, similarly, the excluded volume of the modeled chains. The 
notion of excluded volume, introduced by Kuhn [10], was employed by Paul Flory [6] 
to explain that certain fundamental aspects of polymer physics are a consequence of 
the fact that a portion of a long polymer can not occupy the same space as another 
portion of the same polymer. As a result, for example, circular polymer chains can not 
change their topological knot type, i.e. “crossings are forbidden,” and, furthermore, 
their properties are distinct from those of infintesimal mathematical polygonal rings. 
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The properties of excluded volume and thickness have proved to bea particularly chal- 
lenging facet of the structure to simulate, even in a coarse grained manner, across a 
meaningful physical range of values. 

Our focus will be on the available methods to sample the sets of open and closed 
polygons with which one may try to access structural information related to the thick- 
ness of the configuration. After an overview of how one may define the thickness of 
a polygon, we will describe these ways in sufficient detail that one may develop an 
algorithm to implement these methods. The first topic will be the ways in which one 
samples open and closed polygons without thickness constraints. These are subse- 
quently sorted by thickness with the hope of generating a statistically robust sample 
for each range of thickness. Unfortunately, the rapid decay of sample size with increas- 
ing thickness limits the effectiveness of this method. We will next discuss two new 
methods that allow one to randomly sample collections of open and closed polygons 
having any desired thickness and, in conclusion, briefly describe some observations 
resulting from these new methods. 


11.2 Classical Perspectives 


The concept of excluded volume was introduced by Kuhn [10] and applied by Flory in 
the study of polymers [6, 21]. For example, the Kuhn length of polyethylene (PE) is 14 
angstroms. Of coarse, the molecule has a cross-sectional molecular diameter as well 
as steric forces that give it an effective thickness that must be taken into consideration 
in a model. As noted, this has proved to be quite challenging due to the computational 
cost of amolecular model and the lack of a cruder, coarse grained, simulation method. 
If the effects of the excluded volume are neutralized in a given situation, described as 
being at the theta point, one ignores these interactions and employs the ideal chain 
model. These are defined as freely joined polygonal chains in which one ignores in- 
teractions between edges. We will now describe how one measures the thickness of 
polygonal chains, how such chains can be easily generated, and the character of the 
resulting thickness distribution. 


11.2.1 Thickness of polygons 


We employ the ideas of Rawdon [12, 20], to define the thickness of a polygon in 3- 
space (which Rawdon calls its injectivity radius). The underlying concept is that of a 
tubular neighborhood of a smooth curve consisting of the embedded family of discs of 
radius r at each point of the curve. The supremum of the radii of the embedded tubular 
neighborhoods is called the thickness of the curve. Rawdon extended this concept 
to polygons by taking the minimum, over all vertex angles, of a curvature constraint, 
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Fig. 11.1: Tubular neighborhoods of the unit equilateral triangle including one of maximal radius v3 


wr , and half the minimum distance between distant edges, taken over all pairs of 


distant edges, the distal distance constraint. Tubular neighborhoods of an equilateral 
triangle are illustrated in Figure 11.1 and, of random walks, in Figure 11.2. 


11.2.2 Self-avoiding random walks 


An embedded open chain with n unit edge lengths and random vertex angles is a self- 
avoiding random walk, shown in Figure 11.2, is given by a sequence of vertices, v; with 
i = 0,1,...n or, equivalently, a sequence of edge vectors, e; with j = 1,...n. These 
are related by vo = {0, 0,0} and v; = a e;. To randomly generate an n step ran- 
dom walk (or n edge random open polygon) it is sufficient to individually generate 
the corresponding n edge vectors as follows: select 6 = 2 1 randomreall[0, 1] and @ = 
ArcCos[2 randomreal[0, 1]-1] to define e; = (Sin(@) Sin(@), Cos(@) Sin(@), Cos(@)). 


11.2.3 Closed polygons: fold algorithm 


An embedded closed chain with n unit edge lengths and random vertex angles is a 
self-avoiding polygon constrained by the requirement that the initial and terminal ver- 
tex are equal, i.e. the closure constraint. The satisfaction of the closure constraint is 
the critical feature of sampling algorithms. The fold method preserves the closure by 
randomly selecting a pair of non-adjacent vertices to define an axis dividing the poly- 
gon into two arcs. One of these arcs is rotated through a random angle and adjoined to 
the complementary arc to define, following confirmation that resulting configuration 
is embedded by directly checking for possible edge intersections between edge seg- 
ments, a new polygon, see Figure 11.3. We also show the result of 100 random folds 
applied to the 10 edge regular polygon. While its randomness is an open question, the 
fold algorithm is known to be a transitive algorithm, i.e. one capable of taking any con- 
figuration to any other configuration, whose resulting statistics, for example those of 
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Fig. 11.2: A 20 step random walk: curvature constraint is 0.041 and the distal distance constraint 
is 0.071. The 100 step random walk has angle constraint of 0.054 and distal distance constraint 
0.026. 


observed knot types or average radius of gyration, are consistent with being random 
[16, 17]. 


11.2.4 Closed polygons: crankshaft algorithm 
The crankshaft method is a computationally attractive algorithm that has also been 


shown to be transitive and having statistics consistent with randomness [1]. This al- 
gorithm randomly selects a pair, larger subset if desired, of edge vectors of an equilat- 


Fig. 11.3: A regular 10 edge polygon, a first fold, and one created from the regular polygon using 100 
random folds along axes such as the one indicated. 
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Fig. 11.4: A regular 10 edge polygon, a first crankshaft, and one created from the regular polygon 
using 100 random crankshaft modifications. 


eral polygon and defines a segment based at the origin using this ordered sequence 
of vectors. Following a random rotation about the axis defined by the origin and the 
endpoint of the segment, one replaces the originally selected edge vector by the cor- 
responding one in the resulting segment. Since the sum of these edge vectors has not 
changed, the newly defined polygon is closed. If it is embedded, the result is the new 
polygon, see Figure 11.4. The fact that one only has to modify a small collection of edge 
vectors, in our case just two, makes the calculations fast as they are explicit as shown 
in the following formulae. 


ida | + cos(@) i — a + sin(@) 1 i (11.2.1) 


e; + crank(e;, e;, 0) = 
ae 2 lier * ejl 


' + cos(o) 2! is a + sin(6) i na 


ee _ (11.2.2) 
2 2 |e; + e:l| 


ep crank(e;, e;, 9) = 


11.2.5 Quaternionic Perspective 


Building upon a construction by Hausmann and Knutson [8], it is possible to describe 
a natural measure on open and closed polygons in 3-space [2, 3]. This is based on a 
map from the complex Stiefel manifold of 2-frames in n-space to the space of closed 
n edge polygons in 3-space of total length 2 whose construction employs the Hopf 
map on quaternions. In the setting of equilateral n edge polygons, this approach al- 
lows one to randomly sample these polygons by jointly sampling the polyhedron de- 
termined by a complete set of secants and n — 3 dihedral space angles between the 
triangles formed by the secants and the edges of the polygon, each with the uniform 
measure. Data generated from this approach is consistent with the data from the ran- 
dom fold and crankshaft methods and, therefore, provides evidence that both the fold 
and crankshaft algorithms sample the space of n edge equilateral polygons randomly. 
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Fig. 11.5: A thickness distribution of a random sample of 250000 twenty edge equilateral polygons. 


11.3 Sampling Thick Polygons 


A consequence of the desire to understand the geometric and topological conse- 
quences of excluded volume on the average character on n edged open and closed 
polygons, i.e. thick coarse grained models of polymers, one is led to use the methods 
we’ve discussed to acquire data that gives insight into the world of thick polygons. Un- 
fortunately, the proportion of a random sample decays so rapidly that this approach 
means that generating sufficient data required to give robust estimates as thickness in- 
creases, see Figure 11.5, is unreasonable. This is especially true if one wishes to study 
the average properties of, for example, trefoil knots. In this section, we describe two 
new sampling algorithms that permit one to randomly sample the sets of open and 
closed polygons of a specified thickness. They were first presented in the the PhD dis- 
sertations of Laura Plunkett, for open polygons, and Kyle Chapman, for closed poly- 
gons [4, 19] at University of California, Santa Barbara. They are described below. 


11.3.1 Primer on Probability Theory 


Much of the value of the following algorithms comes from being able to use them to 
prove a Markov chain arising from the moves involved can be used to determine a 
probability measure on various sets. In order to motivate these algorithms, we start 
with some of the key definitions involved. 

We consider a space of states, X. We have a space of moves M, which gives a map 
F:XxM — X. If we choose a starting point xo, then a random sequence in M gives 
a random sequence in X. This is a Markov chain which gives a sequence of probabil- 
ity distributions P"(xo, A) on our space X defined by the probability of starting at xo 
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and landing in the Borel set A after n moves coming from M. The goal is to show that 
this sequence of probability distributions P"(xo, A) converges to a limiting probability 
distribution P(A). 

The first step is knowing that there isn’t some lower dimensional subset we are 
restricted to. This is given by the notion of Forward Accessible, which means that 
independent of the choice of xo the orbit under M has non-empty interior. While any 
single sequence only reaches a countable number of points, the set of all possible 
sequences can reach a full dimensional set. 

The next step is knowing that the topology of the space is sufficiently well re- 
spected. We have the sequence of probability distributions P", so given any distri- 
bution on the natural numbers a, we get a new probability distribution Ka(x, A) = 
Soy (a(n) - P"(x, A)). Our Markov chain is a T-chain if there is some measure T on 
X with T(x, A) < Ka(x, A) for all A, T(x, X) > 0 for all x and T(x, A) lower semi- 
continuous in x for all A. This is a complex and difficult to show directly, but having 
other properties, we use Lemma 11.3.1. (See below.) A Markov chain which is forward 
accessible and with M having a lower semi-continuous probability density function 
is a T-chain. In our cases, moves from M are being chosen uniformly, so it has the 
required PDF, and so we have a T-chain. 

Having a reachable state xo for our space, and we having a T-chain allows us to 
assert that there is an invariant measure on X, but it may not be a probability measure. 
For example, if we consider a random walk on the line or in the plane, the sequence P” 
spreads steadily over the whole region, and the invariant measure is Lebesgue mea- 
sure. The property which shows this issue does not happen and that the invariant mea- 
sure P is a probability measure is bounded in probability on average. This says that 
for every € there is a compact set C with P? = +77, P' satisfying lim inf P"(C) > 1-«. 
Essentially this amounts to showing that the probability of going to “oo” is zero. We 
will be dealing with state spaces X which are compact, so this issue is automatically 
resolved, and so the invariant measure P is a probability distribution. 

Finally, these can be combined to allow the use of the aperiodic ergodicity theo- 
rem to get the desired result. Letting B(X) denote the Borel subsets of X, 


SUP acp(x)|P"(x, A) - P(A)| > 0. 


11.3.2 Open polygons: Plunkett algorithm [18] 


We would like to generate random elements from the set of walks with n edges and 
thickness t. We will call the set of these walks W(n, t). To sample this space, we will 
start with a walk in the space, W € W(n, t), and modify it, while ensuring we always 
have a thickness of at least t, until the resulting walk, W’, is independent of the origi- 
nal. 
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Our method is inspired by the pivot method, a Monte Carlo method proposed 
by Lal, and later implemented and proven to be ergodic by Madras and Sokal. The 
pivot method generates self-avoiding walks on the lattice by applying successive lat- 
tice symmetries at randomly selected vertices along the walk and accepting the new 
walk if it was not self-intersecting [13, 11]. Analogously, we perform reflections at ver- 
tices to portions of the walk. 


— Asingle reflection will consist of selecting a random vertex and a random plane 
through that vertex. One portion of the walk will be reflected through the plane, 
and the other will remain fixed. If the new walk is still in W(n, t), we accept the 
move and continue. If it is not, we return to the walk before our reflection and try 
again with a new vertex and a new plane. 

— A double reflection will consist of selecting two random vertices and a random 
plane through each vertex. The reflections are performed successively, and if the 
end result is a walk in W(n, t), we accept the change and continue. If it is not, we 
return to the walk before our reflections and try again. 

— The acceptance probability of a reflection move varies with the thickness and 
length of walk. For example, for random walks of length 1000 and thickness of 
1.00, the acceptance rate is 25.59% [18]. 


After a sufficient number of single and double reflections, we have a new walk, Ww, 
independent of the starting walk W. What remains is to show that this method samples 
the space W(n, t) ergodically [14]. Informally, this means that the Monte Carlo Markov 
Chain gives a sampling method that converges to the probability measure on, in this 
case, acontinuous state space. The first requirement in showing that our method is er- 
godic is to showit is transitive. A transitive Monte Carlo method is capable of sampling 
(or visiting) every state in the state space. For our method, proving transitivity would 
require showing we can go from any walk, Wo, to any other, W;, through a finite se- 
ries of single and double reflections with each intermediate walk in W(n, t). Because 
these reflections are reversible, this is equivalent to showing that we can get from any 
configuration, Wo, to the straight configuration, Ws, a straight rod of length n. 

So suppose we start with a walk Wo € W<(n, f). We find a path to the straight 
configuration, Ws, by the following steps: 


— we will perform single reflections on the outside of the convex hull of our walk 
until the ends of the walk determine the diameter of the convex hull, then 

— wewilluse double reflections to remove critical points (maxima and minima) from 
the walk, and lastly 

— we will perform more single reflections to straighten the configuration until we 
arrive at Ws. 


246 — Kyle Chapman, Kenneth C. Millett, and Laura Plunkett 


If we have a walk, W;, then we may perform any reflection through a vertex on 
the surface of the convex hull by a plane incident with only the surface of the convex 
hull. No reflection of this type will decrease the thickness of the resulting walk, as all 
pairs of edges either remain in the same position relative to each other, or the distance 
between them is sufficiently increased so as to never be less than 2t. We will begin by 
reflecting in this way until both ends of the walk are on the surface of the convex hull 
and, further, that the two ends determine the diameter of the convex hull, as in Figure 
11.6. 


Initial Configuration 


Intermediate Configuration 


One Endpoint on Convex Hull 


Fig. 11.6: Here we have three successive reflections through planes incident with points v; on the 
convex hull. The end result is that the two ends of the chain have been moved to the surface of the 
convex hull, and determine the diameter of the convex hull. While the represented case is two di- 
mensional, the application is in three dimensions. 


Now we will remove pairs of maxima and minima (relative to the diameter of the 
convex hull, d’, from the last step) through a series of double reflections. If our walk W; 
has no critical points, we may move on to the last step, the straightening. If there are 
critical points, each critical point (with respect to this diameter of the convex hull), 
is either at a vertex v; or an edge s;. There is a plane through the critical point and 
perpendicular to the diameter, which we will call P;. 
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We will select our maximum and minimum very carefully. First, We select the 
highest local maximum, that is, the local maximum m; such that the distance between 
its plane P; and the plane P, containing the terminal and highest point is minimized, 
as in Figure 11.7. Between this maximum m; and the end of the walk vn there is at least 
one local minimum. We will select the local minimum nj, i < j < n, such that the dis- 
tance between the plane containing n,, P;, and the plane containing vj, P;, is greatest, 
as in Figure 11.7. 

First, we will reflect the terminal end of the walk from m; to vn through the plane 
P;. Then, we will reflect the terminal end of our modified chain, from hi to Yn through 
the plane P,, the reflection of P;. As in Figure 11.7, we have eliminated a pair of critical 
points. We now have three subsections of the walk to consider: from vo to m;, m; to 
fi; and ni; to in. Between vo to m; and m; to fi;, we have performed a reflection similar 
to part 1, and no thickness reducing interactions have been introduced. Similarly for 
the interaction between m; to fi; and fi; to in. For the interaction between the first 
and third segments, from vo to m; and fj to in, the third segment has been translated 
further away from the first segment. This increased distance is enough to verify that 
the thickness of the resulting walk has not been decreased. 

Lastly, we will straighten this configuration by performing single reflections. As- 
sume we have straightened the segments from v;,; to Vn and we wish to straighten the 
next edge s; between v; and v;,,. As in Figure 11.8, we perform a reflection so that the 
straight segment from v;,; to vn is in line with the edge s;. We proceed until we have 
the straight configuration. 

To prove that this method is ergodic, in addition to being transitive, we need the 
following lemmas about Markov chains. 


Lemma 11.3.1. The Markov Chain F being forward accessible and the state space X 
having a probability density function implies that F is a T-chain. 


These are modifications of the arguments in Meyn and Tweedie [14]: Propositions 7.1.5 
and 6.2.4. 


Lemma 11.3.2. A T-chain with a reachable state is bounded in probability on average 
if and only if it is positive Harris recurrent. 


This is proposition 18.3.2 in Meyn and Tweedie [14]. 
Lemma 11.3.3. F is aperiodic. 


Sketch of Proof Seeking a contradiction, suppose there exist C;, subsets of the state 
space X or in this case W(n, ft), such that i € Zn, the cyclic group of order n, and they 
are a collection of disjoint non-empty closed sets with the probability of going from C; 
to C;,, is one. Let w € C;. In our context, if we apply a pair of reflections at the second 
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Fig. 11.7: Here we have a typical double reflection move for part two of the proof. First, we iden- 
tify the local maximum, m;, and local minimum, m, such that m; is the highest maximum and n; is 
the lowest minimum, relative to the axis selected at the beginning of this step. The first reflection 
through the plane P; reflects the segments between m; and vp. The reflection through P,, the re- 
flected image of P;, reflects the segments between fi; and (,. This gives us our new configuration, 
with two fewer critical points and with the same (or greater) thickness. 
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Initial Configuration After Reflection 


Fig. 11.8: In the previous step, we eliminated pairs of maxima and minima relative to d*, the diam- 
eter of the convex hull in the first step. Now, via a single reflection reflection, we straighten the 
segments Sj1, .... 5; to be in line with s;. 


to last vertex through planes containing the last edge, then w is left fixed by this double 
reflection. Therefore there is a double reflection move r with r(w) = w ¢ C;,,. Since 
Cis1 is closed, there is an open neighborhood of w which is not in C;,, so there is a 
positive probability of landing in this open neighborhood of w. This contradicts the 
requirements of periodicity. Thus, F is aperiodic. 


Theorem 11.3.1. The Markov chain on walks in W(n, t), as described above, is er- 
godic. 


Proof. From our work showing transitivity, we have that for any walk there is a se- 
quence of finitely many single and double reflections which can transform this walk to 
the straight walk, and each single and double reflection in this sequence is on the inte- 
rior of asmooth section of F. Then, by concatenating this sequence with the reverse of 
another such sequence, we can connect any two walks on the interior of W(n, t) with 
a sequence of reflections on the interior of asmooth section of F. Therefore F creates a 
forward accessible Markov chain. Next, since our noise parameter M has a probability 
density function, namely a constant, we can combine this with forward accessibility to 
get that the Markov chain is a T-chain. We get that any sequence of probability distribu- 
tions is tight, so in particular our Markov chain F is bounded in probability on average 
because our state space is compact. Because there is a sequence of reflections ensur- 
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Fig. 11.9: An example of the three aspects being applied to a hexagon 


ing that every walk can be modified to be within any neighborhood of the straight walk 
with positive probability, we conclude that F has a reachable state. Therefore F, being 
a T-chain with a reachable state which is bounded in probability on average, must be 
positive Harris recurrent. Finally, since there is a fixed point at the interior ofa smooth 
section, F cannot be periodic and so the aperiodic ergodicity theorem [14] tells us that 
F is ergodic. 


11.3.3 Closed polygons: Chapman algorithm 


We work in the space R(n, t) of polygonal rings with n unit length edges and a thick 
ness of greater than or equal to t. We show that any knot K € R(n, f) has a sequence of 
polygonal folds which brings it to the regular planar polygon Py, without having the 
thickness be less than t. Note that this algorithm unknots the polygon and, therefore, 
may not preserve the topological knot type of the polygon. The overall structure for 
making these choices has 3 aspects. The first aspect is being able to expand the knot 
until its orthogonal projection to the x- y plane is a convex projection, without chang- 
ing the number of minimal height vertices, with respect to the orthogonal projection. 
The second aspect is being able to take a knot with convex projection and increase the 
number of minimal height vertices, with respect to the orthogonal projection. These 
two actions are combined and alternated so as to allow one to make the knot planar 
and convex. The third aspect is to take a planar and convex polygon and make it reg- 
ular. 

A key piece for the first aspect, making the projection convex, is the fact that a re- 
flection move across a plane which does not intersect the interior of the convex hull, 
does not increase the thickness. This is a special example of the fact that if no two ver- 
tices are made closer together, then the thickness in not increased. For making the pro- 
jection convex, we utilize the argument of the Griinbaum-Zaks theorem [7]. To properly 
state this theorem we use need a weakened notion of convexity called exposed which 
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Fig. 11.10: Examples of the types of convexity. Overlapping edges are separated for clarity. 


is shown in Figure 11.10. While a polygon in the plane is convex if it is an embedding 
onto the boundary of the convex hull, it is exposed if the image is the boundary of 
the convex hull and the pre-image of each vertex is connected. This means that the 
polygon might double back over an edge, but still cycles around the convex hull. The 
Griinbaum-Zaks theorem says that any polygon can be made exposed using a finite 
sequence of reflections across edges of the convex hull. 

The Griinbaum-Zaks theorem allows us to make the projection exposed, so we 
need to make the projection convex. For that, we consider an edge of the convex hull 
which is not hit injectively. Taking the pre-image of that edge we get a strip, with an 
arc of the knot moving from one side of the strip to the other, as in Figure 11.11. Look- 
ing at the convex hull of this arc we can find a sub arc which has end points which 
share a bottom edge of that convex hull. This sub arc can be pushed out slightly using 
a rotation about the line labeled | through a small angle, which can be done with a 
pair of reflections. This makes the projection more injective and so alternating with 
Griinbaum-Zaks theorem completes the first aspect. 

The second aspect is much simpler. Since the knot is not planar, we can find a sub 
arc which is above the minimum height except at the endpoints which are minimal 
height vertices. This arc can be rotated down until another vertex has reached minimal 
height, as in Figure 11.12. 

The final aspect uses a special move which consists of six reflections. Within the 
planar convex polygon, we choose an inscribed quadrilateral, which has two oppo- 
site corners which correspond to vertices at which the interior angles are smaller than 
the regular angle, and the complementary two corners correspond to vertices at which 
the interior angles are larger than regular the regular angle. Focusing on the inscribed 
quadrilateral, two reflections can give a new planar quadrilateral with different an- 
gles. This leaves four flaps in the form of the arcs connecting the vertices of the in- 
scribed quadrilateral, and so an additional four reflections will make the whole knot 
planar. Intermediate value theorem allows this collection of six reflections to be cho- 
sen so one additional angle is regular. 
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Fig. 11.11: An example of the pre-image of an edge of the convex hull. a denotes a vertex which is 
strictly above another point in the arc. a. denote the vertices in either direction which are on the 
bottom of the convex hull of the arc. / is the boundary line of this convex hull shared by az 


Fig. 11.12: An example of a rotation which increases the number of minimum height vertices, shown 
from a projection which is parallel to the axis of rotation. 
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Fig. 11.13: An example of applying the six reflection move to a decagon. The initial diagram has four 
distinguished vertices connected by dotted lines. 


We now let F denote the Markov chain arising from applying sequences of 6 re- 
flection moves at a time to a closed polygon x in the state space X = R(n, t) of n edge 
rings with thickness at least f. 


Lemma 11.3.4. The Markov chain F is ergodic 


The algorithm we described shows that any point x can be brought to the regular pla- 
nar polygon with a finite sequence of moves, so by using this and its reverse, we can 
connect any two points in the state space. This means that we can utilize the same 
argument for R(n, t) as for W(n, t). The algorithm shows forward accessibility and a 
reachable point. This lets us claim that it is a T-chain. Since it is a compact state space 
we get an invariant probability measure. Finally, since there exists moves which take 
a point to itself, the Markov chain is aperiodic, so the aperiodic ergodicity theorem 
applies. 


11.4 Discussion and Conclusions 


Considering open polygons, we generated a data set consisting of 5000 samples for 
each of the following pairs of lengths and thicknesses: 


— lengths, N: 100 to 1000 in steps of 100. 

— thicknesses: 0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9 and 1 (Note, these correspond 
to minimal interior angles of about 0°, 23°, 44°, 62°, 77°, 90°, 100°, 109°, 116°, 
122°, and 127°.) 


Even for the thickest and longest of these samples, the acceptance rate for a pro- 
posed reflection or double reflection was above 25%. From this data we examined 
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squared radius of gyration, squared end to end distance, and for the length 300 sam- 
ples, knotting in the open walks [15]. 
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Fig. 11.14: Knot probability as a function of thickness for self avoiding walks of length 300. For each 
0.1 thickness is increased, the probability of knot formation is cut by 50% or more, up to ¢ = 0.5. For 
t > 0.5, the probability of knot formation is constant. 


As we see in Figure 11.14, the probability of knot formation for walks with length 
300 is quite high without the introduction of thickness, with more than half of samples 
having a dominant, nontrivial knot type, i.e. a specific knot type appearing more than 
half the time. For each 0.1 (to be compared to the length of each segment, 1,) that 
the thickness is increased, the probability of knot formation is cut in half until the 
thickness is at 0.5 after which this probability is constant. We conclude that the intro- 
duction of thickness has an immediate and profound effect on knot formation, and we 
expect that further studies will confirm that this effect persists to longer length scales 
and in both thicker and thinner walks. 

We also analyzed the growth exponent for the thicknesses from 0.0 to 1.0 by find- 
ing the average radius of gyration squared, RG?, for each data set, and performing a 
linear regression with vertical offsets on the log of the data. Assuming that RG; « N”, 
this allowed us to solve for v as a function of thickness. 

For ideal chains it has been shown that RG;, « N, where N is the number of 
segments. It is easy to verify that for t = oo we have the straight configuration and 
RG, x N? [22]. Excluded volume has been classically characterized by the estimate 
vy = 1.2 [5]. In wet experiments and numerical simulation for good solvent polymers, v 
has been estimated between 1.1 and 1.2, with 2(0.588) = 1.18 in walks on the simple, 
cubic lattice [5]. We can see that there is an immediate impact of thickness: the scal- 
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ing exponent increases to the [1.14, 1.17] interval almost immediately, as predicted 
by Vologodskii’s simulations of very thin walks and rings [9]. 

For comparison, we will refer to 1; as the scaling exponent for the self avoiding 
walk with thickness t, and yt as the scaling exponent for the family of walks with only 
a bending constraint for adjacent edges, and no restrictions equivalent to excluded 
volume. 


Estimated Growth Exponent 
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Fig. 11.15: We analyzed how the squared radius of gyration scaled as a function of N. Assuming that 
RGi, o NY”, we solved for v and plotted it as a function of thickness for self avoiding walks (red, 
solid). The exponent, as expected for the ideal case, was close to 1 for thickness 0.0 and increased 
to 1.16 where it was fairly stable (+.01) for thicknesses t € [0.2, 1.0]. 


For those walks without long range or excluded volume interaction the growth 
exponent is, as expected, close to 1 as these walks should scale like ideal chains. The 
thick walks behave very differently. The growth exponents for families of walks with 
very small thicknesses immediately jumps to 1.16 where it was fairly stable (+.01) for 
thicknesses t € [0.2, 1.0]. Therefore the effect of excluded volume is as immediate 
and profound on growth exponent as it is for knot formation: even the introduction of 
very modest thicknesses has a profound effect on shape, size and knotting. 
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Equilibria of elastic cable knots and links 


Abstract: We present a theory for equilibria of geometrically exact braids made of two 
thin, uniform, homogeneous, isotropic, initially-straight, inextensible and unshear- 
able elastic rods of circular cross-section. We formulate a second-order variational 
problem for an action functional whose Euler-Lagrange equations, partly in Euler— 
Poincaré form, yield a compact system of ODEs for which we define boundary-value 
problems for braids closed into knots or links. The purpose of the chapter is to present 
a pathway of deformations leading to braids with a knotted axis, thereby offering a 
way to systematically compute elastic cable knots and links. A representative bifurca- 
tion diagram and selected numerical solutions illustrate our approach. 


Keywords: elastic knots and links, cable knots, equilibria, variational problem, bifur- 
cation 
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12.1 Introduction 


Contact problems in the theory of elasticity have seen a surge of interest recently. They 
are challenging because of the one-sided constraints they introduce. Determining the 
topology and geometry of the contact set given the boundary conditions is probably 
the hardest part of any contact problem. Although progress has been made in some of 
the 1D continua problems [8], [5], [13], we are still far from a full understanding of the 
solutions of the Euler-Lagrange equations for general contact problems. The theory 
of elastic braids in [12] deals with a special type of (self-)contact problems of elastic 
rods involving only equality constraints. 

In this work we focus our attention on closed braids whose axes make a torus knot, 
i.e., so-called cable knots or links [2]. Such knots or links form naturally in (virtual) 
experiments with elastic torus knots/links if they are made to buckle under the inser- 
tion of twist and then allowed to go through a (multiple) self-intersection (the simplest 
example being the passage through a figure-8 with a single self-intersection). We do 
not intend to present in this chapter a global classification of all equilibria of knot- 
ted 2-braids; rather we explain a particular scenario of how they appear in a typical 
bifurcation diagram as a control parameter (such as the twisting moment) is varied. 
We make no assumptions on either the topology or the geometry of the contact set ex- 
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cept that it is closed on itself. We obtain the shape of the contact curve as part of the 
solution. 

In Section 2 we first recall the essential equations of the elastic 2-braid theory [12] 
specialised to the present case of (n, 2)-cable knots/links. Then, in Section 3, we for- 
mulate boundary conditions that smoothly seal the ends of the rods so that a knot 
or a link is formed and discuss a bifurcation diagram and numerical solutions before 
closing this study, in Section 4, with some concluding remarks. 


12.2 Theory of elastic braids made of two 
equidistant strands 


12.2.1 Equidistant curves, reference frames and strains 


Consider a pair of smooth curves r,(s) € R’, s € [0, Li], r2(o) € R?, o € [0, La], both 
parametrised by arclength, that serve as centrelines of two inextensible and unshear- 
able elastic rods of length L := L, and L>, respectively. We denote by t;(s) = dra(s) 
and t,(0) = dra(o) the unit tangents to each of the curves. We define the point-to- 
point squared distance function D2(s, a) = p(s, 0), where p(s, 0) = r2(0) - ri(s) 
is a chord vector connecting the two curves. We assume that there is a one-to-one 
mapping [0,L,] «~ [0,L2] : s + o(s) between the two curves such that D> is bi- 
critical at corresponding points, i.e., 202 (sg, o)|. = -2p(s, o(s)) - ti(s) = O and 


o(s) 
2Dr(s, 0) 6 = 2p(s, o(s)) - tp(o(s)) = 0. Then Do(s, o(s)) is constant, say equal 
O=O'S 


to A*, and the two curves are said to be at constant distance A. Moreover the chords 
p(s, o(s)) are orthogonal to the curve tangents at both ends. Because of the one-to- 
one mapping there exists acommon parametrisation for both curves. Taking arclength 
along the first curve as the common parameter, we can introduce the unit chord vector 
d,(s) = 4p(s, o(s)) = %[r2(o(s)) - r1(s)|. We shall henceforth write expressions like 
r>(s) instead of r>(a(s)). 

Along each curve we define two moving frames as follows (Fig. 12.1). We define 
u, := t,x d, so that the vectors t;, d; and u, form an orthonormal frame that we call 
the first braid frame. We also define an orthonormal material frame on the first rod: 
{t,, do, Vi} (do pointing in an arbitrary normal direction). The two frames differ by a 
rotation about tf; through an angle , measured from do to d;. Correspondingly, we 
define two moving orthonormal frames on the second curve r. The first one (called 
the second braid frame) is made up of the vectors f., d; and up := ty x d;, while the 
second one (called the second material frame) is {t2, d2, v2}. They differ by a rotation 
about the tangent vector t2 through an angle é2, measured from d, to dp. 
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Fig. 12.1: A braid made of two rods. Centrelines are drawn as thick blue curves while red helices 
show how the material twists. For clarity the rods are shown with smaller diameter. 


After choosing a coordinate system we may identify the orientations of the above 
four reference frames with elements of the group of orthogonal 3 x 3 matrices: 


Ri(s) :=(G(s), d;(s), B,(s)) ¢ SOG), 
B;,(s) =t;(s) x d;(s) = i(2 — i)u,(s) + |i - 1]v;(s), 


where i € {0, 1,2}, j € {1,2}, ij 4 02, 21. These define four skew-symmetric 3 x 3 
matrices in the Lie algebra so(3) as follows: 


@=RURy, Q=RERp, @=RURn, Q=RI,R», (12.2.1) 

where we have introduced the ‘hat’ isomorphism between skew-symmetric matrices 
0) —-W3 W2 

w=] w3 O  -wy, | in so(3) and axial (rotation) vectors w = (w1, W2, w3)7 in 


-W2 Wi (0) 
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R?. *! Here and in the following a prime denotes differentiation with respect to s. 
Thus we have defined four axial vectors w = (Wi, W2, W3)", Q = (Q;, OQ, 03)', w = 
(@1, W2, W3)7 and Q = (Q4,Q>, Q3)T, which are the rotation vectors of, respectively, 
the braid frames {t,, di, u,}, {t2, d,, uz} and the material frames {t;, do, vi} and 
{t2, dz, V2}. 

The orthonormal frames form a sequence under consecutive rotations about f1, 
d, and f2. Thus 


Ry = Roi Ri (£1), Ry = Ry, R2(y), Ro = Ry2Ri(E2), (12.2.2) 
where 


1 6) 6) 

R,(&) = exp(&e:)=|0 cosé -sing | ¢ SO(3), i=1,2, e, =(1,0,0)', 
0 sing cosé 
cosy O. sinn 

Roy) =exp(nez)= | O 1 0 | €SO(3), e2 = (0,1, 0)", 
-sinyn O cosy 


and 7 is the angle, about d,, from the first tangent, t,, to the second, t> (see Fig. 12.1). 
From Eqs. (12.2.1), (12.2.2) it follows that the rotation vectors of the material and braid 
frames are related as 


@ = RY(E,)@R (Es) + RY (ER (Er), (12.2.3) 
AO = RU(@R2(q) + RU)Ro(n), (12.2.4) 
Q = RY(E)OR,(E) + RT (E2)Ri(é2). (12.2.5) 


Owing to the inextensibility condition | ora = 1, the arclength parameter o along 
the second centreline satisfies 


o = \/(Aw,)2 + (Aw; - 1)2. (12.2.6) 


We express the angle 7 as a function of the components of w as 


siny = -Aw,/o, cosy =(1 -Aw3)/o, (12.2.7) 
and hence 
W1= (w - i) tany. (12.2.8) 


12.1 Throughout we adopt the notation that for any vector v € R? the sans-serif symbol v denotes 
the triple of components (v1, v2, v3)™ = (Vv: t1,v- di,Vv~- uy)T in the first braid frame. An excep- 
tion is however made for sans-serif rotation vectors, which are always triples of components in their 
corresponding frames. 
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Eqs. (12.2.3)—(12.2.5) allow us to express the strand strains (@1, W2, W3, Q1, Qo, 3) 
in terms of the variables (w1, W2, W3, &1, €2), which we shall call the braid strains. In 
much of the following it will be convenient to use Eq. (12.2.8) to eliminate w, in favour 
of the angle 7, which also has an intuitive physical meaning as braid angle. With either 
choice of variables we have an unconstrained description in which, by construction, 
the constant-distance constraint is automatically satisfied. 

We note that if w; = Oand w3 4 1/A, then 7 = 0 and the two strands are parallel 
(this condition does not restrict in any way the shape of the first centreline). When 
w3 = 1/A, and w, 4 0, the second strand becomes orthogonal to the first one (y = 
+m/2). If both w; = Oand w3 = 1/Atheno’ = 0, ie., the induced parametrisation of 
the second centreline is singular. The angle 7 is then not defined. We need to rule out 
this case. 

For a non-selfintersecting braid, tubes of radius A/2 around each of the centre- 
lines must not overlap. Locally this means that points connected by the bicritical 
chords must be closest-approach points of the centrelines. It may be shown that this 
requires n € (-7/2, 2/2) [12]. 


12.2.2 Equations for the standard 2-braid 


For the elastic energy we make the usual assumption of frame indifference, i.e., the 
energy is invariant under Euclidean motions. It will then depend only on the strains 
(and possibly arclength) and not, for instance, on the centreline r;. We have already 
assumed both rods in the braid to be inextensible and unshearable, so there is no 
elastic energy on account of stretches. Apart from this restriction we allow for arbitrary 
hyperelastic rods [1] and write 


ly 1B) Ii 
Gee i fi@)ds and U)= | fx(Q(o)) do = | fix(Q/o)o ds (12.2.9) 


for the elastic strain energy of the first and second rod, respectively, where f; and f> 
are the strand strain energy densities, and the explicit argument of Qin the o integral 
emphasises that Q is to be regarded as a function of arclength o of the second rod in 
this integral. 

Assuming fixed ends of the first strand, we add -F : [r,(L) —r(0)] = - ip F-t, ds 
as an isoperimetric constraint expression with F a (constant) Lagrange multiplier (to 
become the internal force in the braid). Also, by inextensibility of the second rod we 
require the length of the second strand 


L2 L L 
ie / do= / oss ip P@zayds (12.2.10) 
(0) 10) 10) 


to be constant, giving a second isoperimetric constraint. 
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We can express the arguments of the integrands in Eqs. (12.2.9) and (12.2.10) in 
terms of the braid strains (w, ¢, €2) and their derivatives by using Eqs. (12.2.3)—(12.2.5) 
and (12.2.6) and thus formulate a second-order variational problem for the reduced 
functional: 2s0(3)xR?xTS!xTS! > R, lw, w’, F, &1, €4, 2, &) = fil@)+fo(Q(o))+ 
ho -F- ti, 


L 
6 | tw, w , F, &1, 1, €2, €) ds = 0, (12.2.11) 
0 


for variations that keep the end positions and orientations of the rods fixed; his a 
(constant) Lagrange multiplier. 

The Euler-Lagrange equations are derived partly in Euler—Poincaré form and 
comprise 


(a) balance equations for the components of the overall braid force F = (F,, F2, F3)™ 
and braid moment M = (M,, M2, M3)" expressed in the first braid frame [12] 


F+wxF=0, (12.2.12) 
M +wxM+t,xF=0, (12.2.13) 

(b) the ‘constitutive’ equations 
M=€E,(), (12.2.14) 


where E(k) = & - (3) is the Euler-Lagrange operator for the variable ¢, 
(c) the phase equations for the twist angles of the rods 


Ez) =0, i=1,2. (12.2.15) 


Equations (12.2.12) and (12.2.13) are the moving-frame (advected) versions of the famil- 
iar balance equations F = 0, M +r, x F = 0 expressed in an inertial frame. It follows 
that |F| and F - M are first integrals. 

Equations (12.2.15) can be written as a set of four first-order equations by intro- 
ducing new variables (the strand torques) T;, i = 1, 2: 


ol ol 
a. Vee 
0; 06; 


If we eliminate the variable w, in favour of n by using 7 = n(w1, w3) given in 
Eq. (12.2.7), the reduced density | takes the form 


Tj (12.2.16) 


l= g(w2, W3, un n»Fi, &, Ets &, &), (12.2.17) 
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while Eq. (12.2.14) transforms into [11] 


_ on _ Acos?nfog  d (ag 

a OW, En(g) Aw3-1 E ds \ dn’) |’ (12.2.18) 
_ 08 

rags? (12.2.19) 
_ 0§ , On og Asinncosn/og d / og 

UT ue date ews Awse tk lon ds on (12.2.20) 


(cf. the equations for an elastic strip reduced to the centreline in [10]). 

We now specialise our theory to the case where the two strands are linearly elas- 
tic, uniform, isotropic, initially-straight rods with bending stiffness B; and B> and tor- 
sional stiffness C; and C2, respectively. We allow the strands to be intrinsically twisted 
with twist rates Wo; and Wo2. We call this special case the standard 2-braid. The den- 
sity g in Eq. (12.2.17) then takes the form 


& = Se(W2, W3,NN » £1, &) + h(1 - Aws)/ cos - Fi, (12.2.21) 


with the elastic energy density ge (= f; + f2), the sum of bending and torsional energy 
densities of the two strands, given by (using Eqs. (12.2.3)—(12.2.5)) 


1 atk ie Box. x ce Co (x , 2 
for5 B13 + W3)+ Gr (a + 03) + C1(@1 - Wo1)* + = (01 - o'wo2) = 


- 
2 


2 
+1 | (ws i) tan &~ wo] + 


5 2 A 
ral wae (2 -&) + 2éoa (4 -&) ewes ies] \ (12.2.22) 


cos y 


2 
rey eae + 


(w3 - sin? var _ 


{Bi(w} + 03) + Ba] (1 - Aw3) cosy 


1-Aw3 


This gives a system of 13 ODEs in (F,, F2, F3, M1, Mo, M3, n, W2, W3, &1, T1, &2, T2): 
Eqs. (12.2.12), (12.2.13), (12.2.16), (12.2.18), (12.2.19) and (12.2.20). The last three in fact 
contain two algebraic equations because g does not depend on w, and W3; but these 
can be turned into ODEs by differentiation (and the algebraic equations used to solve 
for y and h). Also note that since g does not depend on the phase angles € and £2, 
the strand torques T; are constants. 


12.2.3 Kinematics equations 
Reconstruction of the centreline of the first strand requires solving for the tangent t, 


and integrating this to get r;. We choose a parametrisation of the first braid frame {t, 
d,, u,} in terms of four Euler parameters (or quaternions) q = (qo, q1, 42, q3) subject 
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to the normalisation condition q) + qj + q3 + qj = 1 [6] and write 


+91 - 93-43 24192 - 24093 
ti= | 24192+2q0q3 |, 41 = | a3-4t+43-43 |, 
24143 - 24042 24243 + 24041 


24193 + 24042 
Ur= | 29293-24091 
9-43 -95+9 


These Euler parameters, unlike Euler angles, give a singularity-free description of ar- 
bitrary rotations in space and are therefore convenient for numerical computations. 
The kinematics equations are obtained by differentiating the above vectors and using 
Eq. (12.2.1): 


qo O -W, -W2 -W3 qo 
Ata} _ljor. 0 3 -w2)] a (12.2.23) 
ds|qo| 2]}w2. -w3 O wy }} qo’ ~ 
q3 W@3 Wr, -W, O q3 


where we recall from Eq. (12.2.8) that w, = (w3- x) tan 7. It is straightforward to 
verify that q - q is a first integral of this system of equations. To find the centreline r; 
we solve Eq. (12.2.23) in conjunction with the equation r, = t,. The second centreline 
is then given by rp =r, + Adj. 


12.3 Numerical solution 


In this section we present results obtained by numerically solving boundary-value 
problems for the standard 2-braid equations developed in Section 2 using the contin- 
uation and bifurcation code AUTO [3]. AUTO solves ODEs using collocation and uses 
pseudo-arclength continuation to advance solutions as a control (or bifurcation) pa- 
rameter is incremented. The code is also able to detect pitchfork or other bifurcation 
points along a solution branch where this branch intersects another solution branch 
and to switch to and compute this second branch. In this work we choose the strand 
torque as a bifurcation parameter. 

Here we consider only the closed braid problem where the two rods form a torus 
knot or link. We define a linking number to distinguish between topologically different 
types of such 2-braid solutions by following Fuller’s extension of the concept of a link- 
ing number to a (not necessarily closed) ribbon [4]. By a ribbon is here meant a curve 
with a field of normal vectors of constant length. First note that the contact curve of 
our knot 5[ri(s) + r2(s)] = ri(s) + 4d;(s), s € [0, L], is a (smoothly) closed curve and 
the director $.d,(s) is an orthogonal vector defined for all s € [0, L]. It therefore forms 
a non-closed ribbon. The linking number of this ribbon is defined as Lk = Tw + Wr, 
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where Wr is the writhing number of the contact curve and Jw = + . tw(s) ds is the 
total twist (the twisting number) of the braid with twist rate tw. To have a closed single 
centreline, Lk has to be semi-integer, Lk = n/2,n ¢€ Z. Alternatively, if we think of 
the ribbon as extended in both +4.d, directions, we may say that, for our knots, this 
ribbon is one-sided (like a MGbius strip [10]) and its edge (given by the centreline of 
the rod) is a single knotted curve. 

We start from an analytical solution (two linked equidistant circular rings) and 
perform continuations mimicking the cutting, twisting and resealing of the rings to 
obtain different torus knots (and links) [12] and eventually, after torsionally buckling 
these knots/links, cable knots/links. The braid axis (contact curve) of this starting so- 
lution is an unknot. We compute solutions with different linking numbers for which 
we derive boundary conditions that ensure smooth closure of the braid into a knot (for 
semi-integer linking number of the braid) or a link (for integer linking numbers). 

In addition to the 13-dimensional system of ODEs derived in Section 12.2.2 we solve 
the 7 kinematics equations for q and r, of Section 12.2.3, so as to be able to apply dis- 
placement boundary conditions and plot solution shapes, and Eq. (12.2.6) to compute 
the second arclength. Integration of this last equation allows us to impose the total- 
length constraint Liot := L + Lz = const., the natural constraint to consider for knot 
solutions. Since q - q is an integral of the kinematics equations, the quaternion nor- 
malisation condition |q| = 1 can simply be imposed by choosing boundary conditions 
for the q; that satisfy it. 

All numerical runs are for elastic parameters B; = By = 1, C,/B, = C2/Bz = 
1/(1+v), where Poisson’s ratio v = 0.3, and in all cases we set Wo; = Wo2 = 0. Givena 
solution of the 21D system of ODEs we can compute other properties such as the nor- 
mal contact pressure p or the curvature x; of the contact curve (see details in [12]). Here 
we make the assumption of frictionless hardcore contact in which the only distributed 
force acting between the two contacting strands is the normal pressure directed along 
the chord vector d,. First integrals are monitored throughout to keep track of the nu- 
merical accuracy of solutions. 


12.3.1 Torus knots 
12.3.1.1 Boundary conditions 


To compute knot solutions of total length Liot we apply the following boundary con- 
ditions 
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x1(0) = 0, (12.3.1) AA OA, (12.3.13) 
yi(0) = 0, 232) YH =o (12.314) 
z1(0) =0, (12.3.3) z1(L) = 0, (12.3.15) 
qo(0) = 0, (12.3.4) qo(L) = q3(L), (12.3.16) 
qi(0) = -1/V2, (12.3.5) ql) = sae (12.3.17) 
q2(0) = -1/V2, (12.3.6) 4qo(L)qi(L) = sin n(L), (12.3.18) 
q3(0)-0, (237) HH = 10), (12.3.19) 
£(0) = 0, (12.3.8) [M2(0) - Byw2(0)]/Bz = -w2(L), (12.3.20) 
£(0) = 0, pas At Ast) Leos n(0) = cos ni een 
T,(0)= tio, — (12.3.10) Hes mae 
T7(0)=t20, (12.3.1) ee he as 


o(0) = 0, (12.3.12) 


where f19 and tz9 are constants. Since for a knot the two strands are part of a single 
closed rod, it only makes good sense to take t29 = —t19 and also Bp = By, Cp = Cy. 

Conditions Eqs. (12.3.1)-(12.3.7), (12.3.13)—(12.3.21) ensure smooth closure of the 
braid into a doubly-covered ring, smooth here meaning continuity up to second 
derivatives of r;, i.e., curvatures. Conditions Eqs. (12.3.13)—(12.3.15) place the end of the 
first centreline r; at the beginning of the second centreline rz, both strands thus form- 
ing a single closed rod. Conditions Eqs. (12.3.16) and (12.3.17) imply d,(L) = -d,(0). 
Condition Eq. (12.3.18) guarantees that t;(L) makes an angle n(L) with t,(0) of the 
same strand. Together with condition Eq. (12.3.19) this ensures matching of the tan- 
gents: the tangent at s = L of the first strand aligns with the tangent at s = 0 of 
the second strand, and vice versa. Conditions Eqs. (12.3.19)—(12.3.21) are equivalent to 
Q4(0)/0'(0) = w,(L), Q2(0)/o (0) = —w>(L) and Q3(0)/0 (0) = -w3(L), which ensure 
that the curvatures at the end of the first strand match the curvatures at the beginning 
of the second strand. 

The total number of boundary conditions is 23 and the three free parameters re- 
quired for solution branches are L, L> and tio. Note that the material frames of these 
knot solutions will in general not be closed. This is not different from similar studies of 
closed single rods (see, e.g., [9, 7]). Material closure requires ¢,(0)+é(L) = 2 (mod 27) 
and can be achieved by inserting the right amount of twist by adjusting t1 (= -fy9). 
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Fig. 12.2: Partial bifurcation diagram for the (7, 2) torus knot with the first four bifurcation points 
(marked with solid black circles) and bifurcating branches. A = 0.0051364907419, Liot = 1.98889. 
Empty circles mark particular cable knots whose shapes are shown Separately in the following fig- 
ures. 


12.3.1.2 Examples of cable knots 


Cable knots [2] can be obtained by inserting twist into the torus knots computed using 
the above boundary conditions thereby forcing them to buckle (bifurcate). The bifur- 
cation diagram in Fig. 12.2 shows how cable knots appear along bifurcating solution 
branches. The figure presents a (T, F) diagram, where T := tio = T; = —T2 is the com- 
mon strand torque and F = |F| is the magnitude of the constant braid force as a char- 
acteristic solution measure. The diagram is for the (right-handed) (7, 2) torus knot. 
The trivial branch, representing unbuckled knots with unknotted braid axis, makes 
an infinite v-shaped curve (only part of this branch, marked 1 - 2 - 3 - 4, is shown 
in Fig. 12.2). The solutions along the v-branch are D7-symmetric with the cenral axis 
aligned with the force vector. We shall refer to them as ‘flat’ solutions because in many 
ways they behave like twisted planar circular isotropic rods (which have linear F-T re- 
lationship for branches of planar equilibria [7]). This is despite the fact that neither 
the contact curve of the braid nor the centreline of the rod is a plane curve. 
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Fig. 12.3: Figure-8-shaped (11, 2) torus knot. T = -2.10752, F = 171.50131 (a) in Fig. 12.2). 4 = 
0.0051364907419, Liot = 1.98889. 
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Fig. 12.4: Buckled right-handed cable knot formed by knotting a right-handed (13, 2) torus knot 
into a right-handed trefoil. T = -9.89105, F = 368.075 ((b) in Fig. 12.2). A = 0.0051364907419, 
Ltot = 1.98889. 
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Fig. 12.5: Buckled right-handed cable knot formed by knotting a right-handed (7, 2) torus knot 
into a right-handed (4, 3) torus knot. 7 = -10.97932,F = 634.5698 ((c) in Fig.12.2).A = 
0.0051364907419, Liot = 1.98889. 
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Fig. 12.6: Buckled right-handed cable knot formed by knotting a right-handed (7, 2) torus knot into a 
right-handed (5, 4) torus knot. T = -15.51471, F = 942.989 ((d) in Fig. 12.2). A = 0.0051364907419, 
Ltot = 1.98889. 
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There exist critical torque levels T, corresponding to bifurcation points 
1,2,3,4,...in the diagram, where new branches of more warped (buckled) solutions 
begin. This sequence of buckling modes is again similar to the behaviour of a single 
closed elastic rod (cf. Fig. 3 in [7]). Figure 12.2 shows branches for the first four bifur- 
cating modes, with the bifurcation points marked by solid black circles. As we move 
further away from the right side of the v-branch, shapes become more warped and at 
some point begin to self-overlap. The braid axis exhibits a multiple self-intersection 
depending on the mode, becoming knotted for modes 2 and higher. Continuing along 
the branch we come to an interval where the shape becomes free of self-overlapping, 
see points a, b, cand d in Fig. 12.2 (this may not happen if the rod is too thick). These 
solutions are cable knots, examples of which are shown in Figs. 12.3-12.6. 

The braid axis of the first mode shape in Fig. 12.3, going through a single figure-8- 
type self-intersection, remains unknotted; we present it here only for completeness of 
the picture. We see that the right-handed (7, 2) torus knot becomes a (11, 2) torus knot 
after self-crossing. The linking number of the shape shown is 3.5 + 2 = 5.5. Each of 
the curves corresponding to the various modes ends at another v-branch (only parts 
of which are shown in Fig. 12.2). These represent multi-covered ‘flat’ solutions also 
having their own bifurcation points. As they are self-overlapping for modes higher 
than one, we do not explore them further. 

Allsolutions in Figs. 12.3-12.6 have been computed by continuation from the (7, 2) 
torus knot and have the same total length Li, (= 1.98889). (The shape in Fig. 12.6 
has a small overlapping domain because the chosen value of A is just too large.) In 
particular, the trefoil cable in Fig. 12.4 is obtained by continuation of a right-handed 
(7, 2) torus knot that after a triple self-crossing becomes a (13, 2) torus knot tied into 
a right-handed trefoil. Thus, the torus knot (inside the tubular neighbourhood of the 
companion trefoil knot) has linking number 3.5 + 3 = 6.5. The corresponding linking 
numbers for the cable knots in Figs. 12.5 and 12.6 are 7.5 and 8.5, respectively. 

The set of all knot solutions breaks down into two components. A disjoint bifurca- 
tion diagram, the mirror image of the present diagram under reflection in the F-axis, 
consists of all solutions connected to the left-handed (7, 2) torus knot (hence, the left- 
handed trefoil). 

The contact pressure profile is drawn in white for s € [0, L] and in black for 
s €[L, Lot] in Figs. 12.3-12.6. This allows one to see the differences for both contact- 
ing strands. The profile differences are larger for thicker rods. We observe that for all 
solutions shown the contact pressure is non-negative and increases with the mode. 


12.3.2 Torus links 
12.3.2.1 Boundary conditions 


To compute link solutions we apply the following boundary conditions: 
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x1(0) = 0, (12.3.24) 
yi(0) = 0, (12.3.25) 
Z1(0) = 0, (12.3.26) 


xi(L) = x1(0), (12.3.36) 
go(0)=0, —(12.3.27) AO oS 
ae 7 3.28) Z1(L) = z,(0), (12.3.38) 
q2(0) = -1/v2, - qo(L) = q3(L), (12.3.39) 

(12.3.29) qi(L) = q2(L), (12.3.40) 
q3(0)=0, — (12.3.30) go(L) + qi(L) =-sin(®/2-7/4), —— (12.3.41) 
(0) =0, (12.3.31) W2(0) fixed, (12.3.42) 
£(0)=0, — (12.3.32) w2(L) = w2(0)cos P+ w3(0)sin®, —_(12.3.43) 

o(L) = La, (12.344) 


T,(0) =ty9,  (12.3.33) 
T2(0) =t29, (12.3.34) 
o(0) = 0, (12.3.35) 


where f10 and f29 are constants and w>(0) is fixed at whatever value the starting solu- 
tion has. 

The q; conditions at s = Oimply that ¢, is aligned with the y axis, d, is aligned with 
the x axis and uw, is aligned with the —z axis. Conditions Eqs. (12.3.39) and (12.3.40) 
ensure alignment of the tangents ¢,(0) and ¢,(L). ® is the angle between d,(0) and 
d,(L) in the plane normal to these tangents. Closed links have ® = +27n, where n is 
a non-negative integer. By using parameter continuation in ® solutions with different 
linking number Lk = +n may be obtained, positive linking numbers corresponding to 
right-handed and negative linking numbers corresponding to left-handed links. 

This gives a total of 21 conditions for a 21-dimensional system of equations. To 
compute branches of solutions for a given link we may vary one of the parameters, for 
example fo or A. For equal twisting of the two strands (in the same direction), t2) may 
be taken equal to -t19. Note that closure of the material frames of these link solutions 
requires é;(L) = &;(0) (mod 27), i = 1, 2, which will not in general be satisfied. 


12.3.2.2 Examples of cable links 


Similar to how we computed cable knots, we first compute a trivial branch of ‘flat’ torus 
links with unknotted contact curve. Then, following the branches of post-buckled 
solutions of different modes, we observe knotting of the braid axis by multiple self- 
intersection, thus forming what we call cable links. Figs. 12.7-12.9 present, respec- 
tively, mode-2, -3 and -4 cable links with equal-length components (L, = L2). The 


Equilibria of elastic cable knots and links ——= 273 


Fig. 12.7: Buckled right-handed cable link formed by knotting a left-handed (6, 2) torus link into a 
right-handed (3, 2) torus knot. 7; = -T2 = -11.46623, F = 364.061,A = 0.016016913268, 
Ly = Lo =1. 


shape in Fig. 12.7 is obtained by taking a left-handed ‘flat’ torus link (6, 2) and fol- 
lowing the second bifurcating branch. After self-intersection, the axis of the braid be- 
comes a right-handed trefoil (a companion knot). As can be verified directly from the 
image in Fig. 12.7, the two strands are not linked inside a tube imagined around the 
knotted braid axis. The linking number of this link equals 3. A similar procedure ap- 
plied to a ‘flat’ torus link emanating from a third mode bifurcation, results in a cable 
of a right-handed (4, 3) torus knot (Fig. 12.8). The two components form a 1-link inside 
an imaginary tube around the knotted axis of the braid. The linking number of the link 
is 9. The fourth mode bifurcating branch of the initial ‘flat’ torus link (10, 2) leads toa 
cable of a right-handed (5, 4) torus knot (Fig. 12.9) and the two strands are not linked 
inside the knotted tube. The linking number of this link is 15. 

In all Figs. 12.7-12.9, the contact pressure profiles are displayed as functions of 
arclength s along the first strand, in white for the pressure on the first strand and in 
black for the pressure on the second strand. Note that for all three examples the con- 
tact pressure changes sign, which means that the physical realisation would require 
a mechanism (e.g., glue or a clamp) to keep the strands together. 


12.4 Concluding remarks 


We have shown how the simplest cable knots and links emerge as equilibrium config- 
urations of closed braids made of two equidistant elastic strands. Cable links present 
an obvious generalisation of cable knots when there is more than one component. A 
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Fig. 12.8: A cable of a right-handed (4, 3) torus knot. 7) = -T2 = -14.49330, F = 633.545, A = 
0.006, L1 =L2 =1. 


Fig. 12.9: Buckled right-handed cable link formed by knotting a left-handed (10, 2) torus link into a 
right-handed (5, 4) torus knot. 7; = —Tz = -14.63829, F = 946.310, A = 0.002, Li = Lz =1. 
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further natural extension of this mechanical problem would be to increase the num- 
ber of strands in the braid, which requires the development of a multi-strand elastic 
braid theory. Although this calls for a more sophisticated technique to account for 
equidistant constraints on multiple strands, we do not expect a significant difference 
in qualitative geometrical properties of mechanical equilibrium solutions for thin elas- 
tic rods. 
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Groundstate energy spectra of knots and links: 
magnetic versus bending energy 


Abstract: In this paper we review recent results on the groundstate energy spectra 
of magnetic knots and links and compare these results with new results on bending 
energy of tight knots and links obtained by using RIDGERUNNER data on curvature. Re- 
markable similarities between the two systems are found. Comparative analysis be- 
tween magnetic and bending energy at groundstate energy level shows that informa- 
tion based on bending energy provides a very good proxy for magnetic end-states. 


Keywords: Knots, links, energy spectrum, constrained minimization, magnetic relax- 
ation, bending energy, topological fluid mechanics. 
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13.1 Introduction 


In recent years much progress has been done in applications of knot theory to mathe- 
matical physics, from classical to quantum field theory, and in physical and biological 
sciences as well. This has led mathematical research to explore new territories at the 
cross-road of several, different disciplines. One interesting problem here is the study of 
minimum energy states of physical knots and links subject to continuous relaxation 
(through diffeomorphisms) of some energy functional. Magnetic relaxation of knots 
and links embedded in an ideal fluid provides indeed a prototype example of mini- 
mization useful to explore and understand similar features present, for instance, in 
elastic systems. Here we present a brief review of results on the groundstate energy 
spectra of magnetic knots and links (published in J. Phys. A: Math. & Theor. [20]) that 
sheds light on similar aspects when we consider elastic, rather than magnetic knots 
and links. Indeed proof of how good this analogy can be is given by comparing, as 
we do here, those results with new results based on bending energy estimates of tight 
knots and links obtained by using data readily available from RIDGERUNNER [1]. 

The material is presented as follows. In Sec. 13.2 we consider magnetic knots and 
links as tubular embeddings in ideal magnetohydrodynamics and introduce basic def- 
initions. The prototype problem of magnetic relaxation under constraints is discussed 
in Sec. 13.3. A solution to the problem of magnetic relaxation of (zero-framed) knots 
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and links is presented in Sec. 13.4. By using RIDGERUNNER data the groundstate mag- 
netic energy spectra of the first 250 prime knots and 130 prime links are presented 
(Sec. 13.5). Similar spectra for the bending energy of tight knots and links are pre- 
sented in Sec. 13.6. A comparative analysis of these results is made in Sec. 13.7. Finally, 
conclusions are drawn in Sec. 13.8. 


13.2 Magnetic knots and links in ideal conditions 


We consider magnetic knots and links in an ideal, incompressible, perfectly conduct- 
ing fluid in S? (i.e. R? U {co}, simply connected). Let u = u(x, f) be the fluid velocity, 
smooth function of the position vector x and time t, with V -u = 0 in S? andu = Oat 
infinity. The magnetic field B = B(x, f) is frozen in the fluid and has finite energy, that 
is 


Be {V-B=0, 0:B=V~(uxB), L2-norm}. (13.2.1) 


A magnetic knot is a magnetic flux tube prescribed by the knot type and the mag- 
netic field B, defined on a regular tubular support J(K) centered on K. We assume K 
to be a C*-smooth, closed loop (i.e. a submanifold of S* homeomorphic to S*), sim- 
ple (i.e. non-self-intersecting) and parametrized by arc-length s. The tube J = K @ 8, 
given by the cartesian product of K and the circular disk S, is centered on the knot, 
whose total length is L = L(K) (hence s ¢€ [0, L]), local radius of curvature p > 0, and 
cross-sectional area A = 7R? of radius R > 0. 

Since the magnetic knot is a physical tube, it is useful to introduce the volume 
V(J), the magnetic flux ® and the magnetic energy M. The total volume is given by 
V = V(5) = 7R?L, with tubular boundary oJ a magnetic surface, i.e. 


supp(B) := J(K) , B-v, =Oondo7, (13.2.2) 


where v, is a unit normal to 07. The existence and regularity of non-self-intersecting 
nested tori, support of the magnetic field inside J, is guaranteed by the tubular neigh- 
borhood theorem [22], provided p = R all along K. The magnetic flux ® is defined 
by 
@= / B-id’x, (13.2.3) 
8 


where now ? is the unit normal to 8; the magnetic energy M is given by 


M=5 /ipieex. (13.2.4) 


V(T) 
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13.3 The prototype problem 


For a magnetic knot, whose field is confined to a single tube of signature (V, ®), 
the combined action of magnetic stresses and Lorentz force induces the field lines 
to shrink like elastic bands, by shortening the knot, while conserving volume and 
flux [12]. Magnetic energy gets gradually converted into kinetic energy, and eventu- 
ally dissipated by viscosity or other dissipative effects, if present. As the relaxation 
progresses, the average cross-section increases proportionately, and the tubular knot 
becomes thicker and tighter, until knot topology prevents any further adjustment: the 
final state is ultimately reached when the relaxation comes to a complete stop (see 
figure 13.1). During this process the knot is also gradually deformed by the action of 
a signature-preserving flow (through diffeomorphisms), that governs the relaxation 
from the initial configuration. Since the tight configuration of the end-state resembles 
that of an ideal knot of platonic features [23], magnetic relaxation provides physical 
mechanism to investigate optimal geometric properties of ideal knots. 

Let (r, 9p, s) denote an orthogonal, curvilinear coordinate system centered on K 
(see [11]); r € [0, R] and Yp € [0, 27] are the radial and azimuthal coordinates in the 
cross-sectional plane of 8, with origin O at s = 0 and 9p = O given by the direction 
of the principal normal to K at O. The metric is orthogonal, with scale factors hy = 1, 
hg, = r’, hs = 1-crcos 9, where c = c(s) is curvature, 


Ss 


9 9x.) = 9e- | r(8)a8, (13.3.1) 
10) 


and T = T(s) torsion. Orthogonality is ensured by eq. (13.3.1), which provides the neces- 
sary correction to the standard azimuthal angle by the torsion contribution (see details 
in [11], Sec. 3). The results presented here are derived by using this metric. 
The magnetic field B may be decomposed into meridian and longitudinal compo- 
nents, that is 
B = (0, Bg, (r), Bs(r)) , (13.3.2) 


and in general we assume that the longitudinal field is far greater than the meridian 
field, ie. Bs >> By,. This is consistent with the usual definition of twisted flux tube, 
whose field lines wind around the knot axis in the longitudinal direction. By using 
the solenoidal condition V - B = 0, the magnetic field can be expressed in terms of 
poloidal (meridian) and toroidal (longitudinal) fluxes ®p and ®r, i.e. 


_(, 1d@®p 1 d®r op oy 
E (07 dr ’ 2ar dr )s (0.32 OoR |’ (13.3.3) 


where the total field is given by the sum of an average field plus a fluctuating field with 
zero net flux, in terms of the flux function p = W(r, 9p, s). The twist h = ®p/@®r of the 
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hb @ ’&~ 
®, : 


Fig. 13.1: Ideal relaxation of a magnetic trefoil knot and a Hopf link. 


field lines provides the magnetic field framing given by (27)! times the turns of twist 
required to generate poloidal field from toroidal field, starting from Mp = 0. 

According to the process described above, knot topology dictates a lower bound 
on the relaxation of magnetic energy M, which must be bounded from below by a 
minimum Mj, > 0, that on dimensional grounds is given by (see [13]) 


Min = m(h)@?V-"3 , (13.3.4) 


where m(h) is a positive, dimensionless function of the internal twist h. Of particular 
interest is the value of h for which m(h) is minimal (m,,;,). A fundamental problem is 
this: 


Problem 1 ([14]). Determine m,j, for knots of minimum crossing number 3, 4, 5, .... 


If Cmin denotes the topological crossing number of the knot and h = 0 (a condition 
referred to as zero-framing), one can prove the following result: 


Theorem 1 ([16]). Let K be a zero-framed magnetic knot with signature {V, ©}. We 
have m(0) = (2/7)"/3 cmin; hence 


1/3 
2 = 
Mrnin = (2) Coin? V-? (13.3.5) 


For a signature-preserving flow, eq. (13.3.5) establishes a correspondence between 
minimum energy levels and topology. However, Mmin « Cmin is a rather loose result. 
From a direct inspection of the knot table (see, for instance, the standard tabulation 
in [21]) with the exception of the trefoil and the 4-crossing knot, for all other values of 
Cmin > 4 there are several distinct knot types for each given C,,;,, and the number gets 
exponentially large for increasing values of Cin (2 for Cmin = 5, 3 for Cmin = 6, 7 for 
Cmin = 7, 21 for Cmin = 8, and so on). Hence, the question is to determine whether a 
one-to-one relationship between energy minima and different knot types of same Cpin 
may exist or not. 
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13.4 Relaxation of magnetic knots and constrained 
minima 


To explore this problem let us consider the relaxation of a magnetic flux tube in some 
generality. Let V; = ar7L be the partial volume of the tubular neighborhood of radius 
r; the ratio of the partial to total volume is given by V;/V(J) = (r/R). Now, let f(r/R) be 
a monotonically increasing function of r/R; for example f(r/R) = (r/R)”, with y > 0; 
y = 2 defines the standard flux tube. A detailed analysis of the relaxation of mag- 
netic flux tube with twist is done in [11]. By using the orthogonal, curvilinear system 
(r, 9p, Ss) and the magnetic field decomposition given by (13.3.3), standard minimiza- 
tion of (13.2.4) is carried out and under the periodicity of 9p and s, subject to these 
assumptions: 

(i) {V, ®} is invariant; 

(ii) the circular cross-section is independent of s; 

(iii) p is independent of s; 

(iv) the knot length is independent of h. 
We have: 


Theorem 2 ([11]). Let K bean essential magnetic knot with signature { V, ®} and mag- 
netic field given by (13.3.3). Constrained minimization of magnetic energy yields 


27%*2 2 
M = Dd’, 13.4.1 
(a 2E* eed) 


where L* is the minimal tube length of the tight knot. 


For a standard flux tube (y = 2), (13.4.1) reduces to 
77) 
* L mh? 2 
M = (Sr + | @. (13.4.2) 


This result is equivalent to the eq. (4.2) of [7] (coefficients left undetermined). Note that 
because of the constraints, for any given knot family we have that (M*)c,, > Mmins 
where angular brackets denote averaging over the number of knots of the same Cypin 
family. 

In order to investigate the relation between energy and knot topology, let us refer 
to standard flux tubes; it is useful to rewrite eq. (13.4.2) in terms of ropelength, a useful 
measure of knot complexity [4]: this is defined by A = L*/R", where L” is the minimal 
length and R* the maximal cross-sectional radius of the knot in tight configuration. In 
the case of the unknot, the least possible value of A (say Ao) is that given by the tight 
torus; hence A = Ao = 27. By using V = mR L* = cst., and after some straightforward 


algebra, we have 
ys 4/3 4/3 2 
M’ = ( aera HEN )orvee (13.4.3) 


In2/3 «2/3 
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Fig. 13.2: Influence of twist h on the energy function m(A, h), plotted against ropelength A, accord- 
ing to eq. (13.4.4). The absolute minimum is given by the tight torus, for which A = Ao = 2mand 
Mo. = 2.70 (from [20]). 


By comparing (13.3.4) and (13.4.3), and under the above assumptions, we can state that 


4/3 4/3 2 
nae. 240 


PCa aL (13.4.4) 


showing the explicit dependence of minimum energy on ropelength and framing 


13.5 Groundstate magnetic energy spectra 


Let us first investigate the minima Mpin = Mpin(h) by plotting (13.4.4) against A for 
h=0,1,2,3,... (see figure 13.2). The absolute minimum m, corresponds to the zero- 
framed unknot (tight torus), given by h = O and A = Ao = 2m: mo = (2m77)"3 = 2.70. 
The groundstate energy of zero-framed flux tubes provides the absolute minimum en- 
ergy level; m(h) remains a monotonic increasing function of A for h < 2: at Aj = 21 we 
have m(h = 1) = 4.05 and m(h = 2) = 8.11. For h = 2 the energy minima are attained 
at h = A/7; thus, by substituting the optimal value A = zth in (13.4.4), we have 


Mmin(h) = 5 1! ah? (hed). (13.5.1) 


For h > 2 (and A = Ao) the functional dependence of m(h) on A ceases to be mono- 
tonic. The same h*/? power-law of eq. (13.5.1) was also found by [7] (p. 206, eq. 4.15, 
by scaling arguments). 

The minimum energy spectra of the first prime knots and links is determined by 
setting h = 0 in (13.4.4) and by using ropelength data (Ax) of each knot/link type K ob- 
tained by RIDGERUNNER, a tightening algorithm developed by [1]. A particularly simple 
expression is obtained by normalizing m(Ax, 0) with respect to the minimum energy 
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Fig. 13.3: Magnetic energy spectrum m = m(#x) of tight knots (top) and tight links (bottom) plot- 
ted against the knot/link number #,, given by the position of the knot/link K listed according to 
increasing value of ropelength Ax = A(#x) (from [20]). 


value mo of the tight torus; thus, we have 


4/3 
aK) = a) _ (35) (13.5.2) 


that gives a one-to-one relationship between minimum energy level and knot rope- 
length. Since the relation Ax = A(K) is not known analytically, it must be recon- 
structed from numerical data. We take Ax = A(#x), where #x denotes the position of the 
knot/link K listed according to increasing values of ropelength given by RIDGERUNNER. 
Hence, instead of tabulating energy levels as function of the knot/link position given 
by standard knot tabulation, by taking Ax = A(#x) we plot m = m(#x), according to in- 
creasing ropelength data. The energy spectra are shown in figure 13.3 for the first 250 
prime knots up to 10 crossings (top diagram) and 130 prime links up to 9 crossings 
(bottom diagram). Remarkably, magnetic energy levels of knot and link types seem 
to follow an almost identical logarithmic law given by the best fit curve shown in the 
plots. 
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Fig. 13.4: Bending energy spectrum é = é(#,) of tight knots (top) and links (bottom) plotted against 
the knot/link number #x, given by the position of the knot/link K listed according to increasing 
values of ropelength Ax = A(#x). 


13.6 Bending energy spectra 


It is interesting to compare groundstate magnetic energy spectra with the correspond- 
ing bending energy spectra obtained by considering bending energy in place of mag- 
netic energy. Since magnetic relaxation is driven by the Lorentz force, that is mainly 
a curvature force, computation of the elastic energy due solely to curvature (bending 
energy) provides an interesting comparison. Bending energy is defined by 


Ey = ; $ K,[c(s)]? ds , (13.6.1) 
e 

where K; is bending rigidity and c(s) is local curvature. By normalizing this quantity 
with respect to the reference value E, = 1K,/R* = K,2‘/?7°/? of the tight torus, we 
have the normalized bending energy given by 

3 Eb _ gelc(s)? .ds 

Eo 24/3 775/3 

By using curvature data of tight knots obtained by RIDGERUNNER, we can easily plot the 
energy spectrum é = é(#,) according to increasing ropelength data. The correspond- 
ing energy spectra are shown in figure 13.4. Remarkably, and similarly to the magnetic 


(13.6.2) 
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Fig. 13.5: Ratio of magnetic to bending energy x = m/@ of tight knots (top) and tight links (bottom) 
plotted against the knot/link number #x, given by the position of the knot/link K listed according to 
increasing value of ropelength Ax = A(#x). 


case, bending energy levels of knot and link types seem to follow an almost identical 
logarithmic law given by the best fit curve shown in the plots. 


13.7 Magnetic energy versus bending energy 


Let us compare the energy values by taking the ratio of magnetic to bending energy 
x = m/é and plot for comparison y = y(#x) according to increasing ropelength data. 
Results are presented in figure 13.5. As we see, with a few exceptions, ratios tend to 
level up around a constant value that for sufficiently large cin for tight knots is given 
by x = 21.62 and for tight links is given by ¥ = 21.42. Thus, on average, for complex 
topologies we see that information on bending energy is proportional to magnetic en- 
ergy. This is actually in good agreement with expectations: since magnetic relaxation 
is driven by the Lorentz force, which to a first approximation is a curvature force, it is 
not so surprizing to discover that relaxed magnetic states are, on average, proportional 
to bending energy end-states of tight knots and links. Departure from average values 
is greater at lower Cyin, Where, probably, the constraints imposed by the assumptions 
affect the most the results. 

Let us now re-examine a common feature of the plots of figures 13.3-13.4. The 
curves dotted by circles results from a linear fit made over each Cy,;,—family, while 
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the continuous curve is the best-fit interpolation over all available data. To the first 
decimal digit, we find that best-fit interpolations follow an almost identical logarith- 
mic law, given by 

m(#x) =aln#x+b, (13.7.1) 


with a = 4.5, b = 10.5 for magnetic knots and a = 4.5, b = 9.3 for magnetic links, 
and similarly 
e(#x) =aln#g+b, (13.7.2) 


with a = 0.2, b = 0.6 for elastic knots and a = 0.2, b = 0.5 for elastic links. These 
unexpected results are quite remarkable and call for some justification. 

Ropelength is certainly an increasing function of topological complexity (given by 
Cmin), also because an increasing number of crossings implies an increasing minimal 
length necessary to tie a flux tube into a knot or a link. Results on ropelength bounds 
[4, 6, 8, 9] show that 

O(erhn) < Ax < O(Cmin N° Cin) (13.73) 


min 
where O(-) denotes order of magnitude. From (13.5.2) we have that m(#x) « [A(# wl! 3, 
by combining this with (13.7.2), we have 


AGI? « aln#g +b. (13.7.4) 


Now, if we assume that the number of knots grows exponentially with cin (a plau- 
sible assumption), then #x ~ C‘™™ for some constant C. Hence, by (13.7.4) we have 
AGRI? & Cmin» OF 

Ax) x 3/4 (13.75) 


min? 
a result that, if not true in full generality, is certainly in good agreement with the lower 
estimate given by (13.7.3). Furthermore, let us set (for simplicity) V = ® = 1 in (13.3.5), 
and define 


—_ Mni 1 
M(Cmin) = AE = = Cmin « (13.7.6) 
We can then relate (13.3.5) to (13.5.2), and write 
2 = 1 
(M(K)) cmin 2 M(Cmin) = “cmin » (13.77) 


since for any given K m(K) can be further decreased to its actual minimum by relaxing 
the constraints (i)-(iv) of Theorem 2. Similar considerations apply to é(K). By writing 
(13.5.2) in terms of #x and substituting this latter into the above equation, we have 

(ACHR) coin 2 2704034, (13.78) 
that gives a new relation between ropelength, averaged over each Cyi,—family, and 
Cmin- Note that the coefficient 27/4 ~ 2.66 is independent of the knot family, and this 
result, in good agreement with (13.7.3), is still one of the best analytical results valid 
for any Cin to date (see, for instance, [4]). 
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13.8 Conclusions 


By using analytical results for the constrained minimum energy of magnetic knots 
obtained in [11], we have established a general functional relationship between min- 
imum energy levels of knots and links and internal twist h, given by an h“/3-power 
law. In the case of standard flux tubes our result is in good agreement with an earlier 
result by [7] obtained by a scaling argument. By using ropelength data obtained by the 
RIDGERUNNER tightening algorithm developed by [1] we have computed the groundstate 
energy spectra of the first 250 prime knots and 130 prime links; we have shown that 
the two spectra follow an almost identical logarithmic law. We have then extracted 
data on curvature and by computing the bending energy we have compared magnetic 
and bending energy spectra, finding a remarkable proportionality between end-states. 
By assuming that the number of knot types grows exponentially with the topological 
crossing number Cin, we have shown that this generic behavior can be justified by 
a general relationship between ropelength and crossing number, that is independent 
of the number of components (knots or links). Moreover, by considering ropelength 
averaged over a given knot family, we have established a new relation between this 
averaged ropelength and cue valid for knots/links of any Cin. However, as recent 
analytical work shows [9], these results cannot be considered fully general and fur- 
ther improvements are expected. In the context of magnetic relaxation, corrections are 
expected to come from finer realization of the analytical constraints (for instance, by 
allowing the cross-section to adapt to optimal shape) and from further improvements 
of the tightening procedure. In any case, our results demonstrate the great potential 
of magnetic energy methods to investigate and establish new relationships between 
energy contents and topological properties of complex systems. Moreover, by using 
curvature information of tight knots and links we can also estimate optimal proper- 
ties of 3D—packing and global geometry. These results can find useful applications 
in many disparate fields, from the study of structural complexity of physical and bi- 
ological filamentary systems [10, 18, 5], to applications in plasma physics and solar 
physics [15, 19]. They may also provide new insight into the ongoing search for funda- 
mental aspects in the mass-energy relations of high-energy theoretical physics [3, 2]. 
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